Journal of Applied Psychology 


Kennetu E, Crark, Editor 
University of Colorado 





Table of Contents 


Donald G. Paterson: 1892-1961 
The Assessment of Creativity in a Research Setting: William D. Buel and Virginia M. Bachner 


Pilot Judgments of Simulated Collisions and Near Misses: A Comparison of Performance with 
Uncoded and Two-Tone Coded Models: John E. Robinson, Jr., Kenneth G. Cook, and 
Charles E. Zeleny 


Variability of Stroke Width within Digits: Charles L. Hughes 


The Interaction of Information Displays with Control System ne in Continuous 
Tracking: R. W. Obermayer, W. F. Swartz, and F. A. Muckler.... 


Cross-Validity of Procedures for Selecting Life Insurance Salesmen: Peter F. on 
Walter V. Clarke, and Charles E. Hall 


Experimental Evaluation of pee Codes for Console siindanld Frank ee Minor and icine 
L. Revesman.. 


Supervisory cides and Work-Team Productivity: J. S. Kidd and R. T. Christy. . 
Bales’ Interaction Process Analysis of Personnel Selection Interviews: Daniel Sydiaha. . 


Protection against Impulse-Type Industrial Noise by Utilizing the Acou:tic Reflex: cig A. 
Chisman and J. Richard Simon... . 


A Factor Analysis of the California Psychological Inventory: a O. Crites, Harold P. 
Bechtoldt, Leonard D. Goodstein, and Alfred B. Heilbrun, Jr... 


Chance versus Nonchance Scores on the SVIB: Richard R. me TO 


Visual Target Location as a Function of Number and Kind of Gigi Seal David M. 
Promisel 


Effect of Personal Characteristics on Relationships between Attitudes and Job Performance: 
Francis D. Harding and Robert A. Bottenberg 


The Stability of Guilford-Zimmerman Personality Measures: Jay M. Jackson 


Relationships between Individual Proficiency and Team Performance and Efficiency: William 
M, Wiest, Lyman W. Porter, and Edwin E. Ghiselli............ 





This is the last issue of Volume 45. 
Volume Title Page and Contents appear herein. 





American Psychological Association 


Volume 45, Number 6 December 1961 





Consulting Editors 


ARTHUR BRAYFIELD, Pennsylvania State 
University 

Grorce E. Briccs, Ohio State University 

NorMAN FREDERIKSEN, Educational Testing 
Service 

Leonarp D. GoopsTEIn, University of lowa 

Epwin R. Henry, Standard Oil Company of 
New Jersey 


JoHN Ho tanp, National Merit Scholarship 
Corporation 


Currrorp E. JurRGENSEN, Minneapolis Gas 
Company 

LAURENCE S. McGAucHRAN, University of 
Houston 

Quinn McNemar, Stanford University 

Harowp F. Rotue, Fairbanks, Morse and 
Company 

Tuomas A. Ryan, Cornell University 

A. C. Wetcn, University of New Mexico 





This journal gives primary consideration to origi- 
nal investigations in any field of applied psychol- 
ogy except clinical psychology, although a de- 
scriptive or theoretical article may be accepted 
if it represents a special contribution in an ap- 
plied field. Quantitative investigations of inter- 
est or value to psychologists working in the fol- 
lowing broad fields will be considered: vocational 
and educational prognosis, diagnosis, and guidance 
at the secondary and college level; personnel re- 
search in business, industry, and government; en- 
gineering psychology; industrial working condi- 
tions; research on opinion and morale factors; job 
analysis and classification research; market and 
advertising research. 


Because of the large number of manuscripts sub- 
mitted, authors should adhere to the rule of 
“brevity consistent with clarity.” The typical 
manuscript should run to approximately 4,000 


words. There is a lag of approximately twelve 
months between receipt and publication of an 
article. Authors may request advanced publica- 
tion if they are prepared to pay the cost of print- 
ing the necessary extra pages. 


Manuscripts should be addressed to the Editor: 


Dr. Kenneth E. Clark 

Office of the Dean 

College of Arts and Sciences 
University of Colorado 
Boulder, Colorado 


All manuscripts must be submitted in duplicate. 
Original figures are prepared for publication; dupli- 
cate figures may be photographs or pencil-drawn 
copies. 


Manuscripts must conform to the style require- 
ments described in the Publication Manual of the 
American Psychological Association. 





Journal of Applied Psychology 


Published bimonthly by the 
American Psychological Association 
Prince and Lemon Sts., Lancaster, Pa. 
and 1333 Sixteenth Street N.W. 
Washington 6, D. C. 


$10.00 per volume 


ArtHurR C. HorrMan 


Production Manager 


Subsc ripiions, orders 


$2.00 per issue 


Heren Orr 
Promotion Manager 


and business communications should be addressed to the American Psychological Association, 


1333 Sixteenth St. N.W., Washington 6, D. C. Address changes must reach the subscription office by the 10th of 


the month to take effect the following month. 
subscribers should notify the p 


Undelivered copies resulting from address changes will not be replaced; 
t office that they will guarantee second-class forwarding postage 


Other claims for 


undelivered copies must be made within four months of publication 


Second class postage paid at Lancaster, Pennsylvania and at additional mailing places. 


© 1961 by the American 


Psychological Association, Inc 





Journal of Applied Psychology 








VoL. 45, No. 6 DECEMBER 1961 








DONALD G. PATERSON 





DONALD G. 


1892 


Applied psychology lost one of its pioneers and 
one of the major contributors to its development 
with the death on October 4, 1961 of Donald G. 
Paterson. Returning to this country after partici- 
pating in the fourteenth International Congress 
of Applied Psychology in Copenhagen, he was 
hospitalized in Minneapolis and died of cancer 
after a brief illness. 

Paterson was Editor of this journal for 12 years, 
1943 through 1954, being the first 
to perform this service under the APA’s owner- 
ship and management of the journal. His list of 
publications exceeds 300 titles, including the au- 
thorship or co-authorship of Minnesota Mechani- 
cal Ability Tests (1930), Physique and Intellect 
(1930), Men, Women, and Jobs (1936), Student 
Guidance Techniques (1937), How to Make Type 
Readable (1940), The Minnesota Occupational 
Rating Scales and Counseling Profile (1941), Lo- 
cal Labor Market Research (1948), Revised Min- 
nesota Occupational Rating Scales (1953), and 
Studies in Individual Differences (1961). It is 
characteristic of Paterson’s mode of working and 


from person 


of his constant involvement with his many stu- 
dents that many of his books and articles were 
published jointly with students and colleagues. 
That his methods resulted in research produc- 
tivity is demonstrated by the list of about 1,400 
titles of articles published by his students which 
was prepared and published at the University of 
Minnesota at the time of Paterson’s retirement 
in 1960. 
Donald G 


>) 


Paterson was born on January 18 
He obtained his undergraduate and gradu- 
ate training at the Ohio State University. Prior 
to World War I, he was an Instructor in psy- 
chology at the University of Kansas, where he 
met and married Margaret Young. His wife and 
their two children, Philip Paterson and Mrs. Rob- 
ert C. Becker, survive him. From 1917 through 
1919 he was Chief Psychological Examiner and 
1 Captain in the Sanitary Corps of the United 
States Army. For the next 2 years he was a mem- 


PATERSON 


1961 


ber of the Scott Company, the first of psycho 
logical consulting organizations. 

Paterson moved to the University of Minne 
sota in 1921, and in 1923 was promoted to Pro- 
fessor. He retired in 1960 after 39 years of serv- 
ice. In this interval he became a leading figure 
in psychology nationally, in his own community 
and in the University of Minnesota. He pioneered 
in the whole advance of student personnel work 
vocational counseling, industrial and personnel 
psychology, and differential psychology. He played 
a major role in the 1930s and 1940s in the estab- 
lishment of the University of Minnesota’s Stu- 
dent Counseling Bureau, the Minnesota Employ- 
Stabilization Research Institute, and the 
university's Industrial Relations Center. He was 
a founder and President of the American Asso 
Applied Psychology, and served as 
Secretary of the American Psychological Associa- 
tion for 6 years 


ment 


ciation of 


He was a Diplomate in indus- 
trial psychology of the American Board of Ex- 
aminers in Professional Psychology. He received 
an honorary LLD degree from Ohio State Uni- 
versity in 1952, and in 
liver the Walter V 


mater 


1956 was selected to de- 
Bingham lecture at his alma 


, 


since 1921 300 


earned their MA degrees, and 88 students their 


approximately students have 
PhD degrees with him as their major adviser 
Additional thousands of undergraduate and gradu- 
ate students have taken his courses in the psy- 
chology of individual differences, vocational and 
occupational psychology, and occupational coun 
seling. Hundreds of his students, currently work 
ing in education, government, and industry, are 
applying their knowledge and engaging in re 
search in f applied psychology. His im- 
print on applied psychology will continue to be 


areas 


seen in the work of these students, and in the 


contributions he has made to the knowledge oi 


psychological principles and their applications ir 
all areas of life. 





Journal of Applied Psychology 


1961, Vol. 45, No. 6, 353-358 


THE ASSESSMENT OF CREATIVITY IN A RESEARCH SETTING 


WILLIAM D. BUEL 


Science Research Associates, Chicago, Illinois 


Recently, Stein (1960a, 1960b), Sprecher 
(1959), Smith, Albright, Glennon, and Owens 
(1961), and Buel (1960) have presented both 
material and methods directed to the prob- 
lems of the assessment and identification of 
creativity among research personnel in the in- 
dustrial setting. These studies illustrate sev- 
eral methods of studying creative perform- 
ance: environmental surveys, descriptive psy- 
chometrics, personal history information, and 
behavioral rating scale items validated for the 
purpose of differentiating between relatively 
more or less creative research personnel. The 
purpose of the study reported here was to in- 
vestigate the descriptive and predictive va- 
lidity of several psychometric instruments and 
a locally constructed forced-choice scale. 


MrTHOD 
Su h jects 


: , ‘ 
The subjects in this study we research personnel 


] nd } ] 
cereal and anima 


lect 


were selected 


in the research center of a major 


feed company.! The subjects with re 
gard to whether their assignments were considered to 
be in an endeavor which called for creative behavior, 
whether they 
ered to be creative. In this manner 54 persons (48 
selected for study.* The 


represented by these pel 


regardless of themselves were consid 


males and 6 females) wer 


areas of technical endeavor 


sons were cereal, animal nutrition, and organic re 


raining in chemistry and 
background. Fifteen held the 


‘ 
gree, 4 held the master’s degree, 28 held the bache 


search with biology as a 


common doctor's de 
lor’s degree, and 18 others had training and experi 
ence in their field but did not hold a degree. The age 
of the group ranged from 22 to 59 with a mean of 


34.9 years. Length of service with the research cen 


ter ranged trom 6 months to vears with a mean 


ot 0.5 Vears 


Raters 


Twenty-two raters evaluated the research person 
Forced-Choice 
Twenty one of these raters also served as ratees for 
some other rater. The 


nel by means of the Instrument 


number of ratees assigned to 


The duthors wish to thank Robert G. Coy and 
the other members of the Quaker Oats Company Re 
search Center who made this study possible 

“A N of 65 is reported upon later in this paper 


for purposes of forced-choice validity and reliability 


VIRGINIA M. BACHNER 


Pure Oil Company, Palatine, Illinoi 


any rater ranged from 1 to 7. All 65 subjects wer 
evaluated with the Forced-Choice Instrument. In ad 
dition, the codirectors of the research center partici 
pated in an over-all ranking of the 65 


criterion to be discussed below 


Psychometric Instruments 


The 


study were as 


psychometric instruments employed in_ this 


follows 


l Adaptability 
Form B (42 pet 
Form B. Forms A and 


Lawshe, 1954 


Test ‘orn 12 


sons) | females 


persons) and 
responded to 
Tiffin & 


Thurstone Temperament Sct 

Guilford-Martin Per 
+. Allport-Vernon-Lindz: St 
ed Edition 
5. Kuder 


sonnel Ir 


Preference 


Forced-Choice 


The item 


based on ; revious st 


troleum research center. The 


competence, methodology, technical application, pe 
sonal 
level, 


objectivity, 


approach to complexity, perseverance, energ 


synthesizing ability, uniqueness of approach 


personal confidence use ot svmbolism 
Since the 


ment has not previously been described in the 


and analogy, et« format of this instru 
litera 
ture, it is briefly described here 
indicated, th 
nature. The prefer 


form of the arithmetic mear f the va 


As previously items wv 


havioral indices 


5) which had been assigns 


rater at a time when those 
form and were used to describe r rsonne 
The item discrimination indi wer the form of 
Pearsonian coefficients calculated 
thetical 
organizational refer 


study. 


hypo 


against a 


criterion of ¢ identical in 


The development of the tet: 


accomplished by matching two 


items on high pref 
erence value and two items on low preference value 
one item in each pair having significant differentiating 
ability (.05 level) and one item in each pair having 
little or no differentiating ability. In addition to this 
form ol 
each pair of items on standard deviation (Buel, 1959 


matching, an attempt was made to match 


Finally, an attempt was made to match pairs of high 
and low preference value item length. In 


this way controls were imposed on central tendency 


items on 


variability, and the desirability or attractiveness of 


an item as a function of length or brevity. The final 





William D. Buel an 


forced-choice scale consisted of 
in each of be 
of their 
ated 


13 tetrads, 
ranked (1-4) in 
the person being evalu 
has recommended 
to 
choice 


the items 


which were to 


terms 
descriptivene ss lo! 
Rundquist 
it is least 
unpleasantness of 


(1950) such a 


that 
is alleviated 
by this approach. The scoring of a scale in this form 
is achieved | Rank 1 Rank 4, 
most and rip- 


and at reasonable assume 
the forcing a 


only 
the 


scoring and 


synonymous to scoring least des¢ 
tive items 


Procedure 


The testing of a 
qualified psychologist between 6 and 9 months prior 
to the inception of this study. All persons tested were 
members of at the time 
een selected for employment by 

The forced-choice 
criterion rankings were 
the 


personnel was accomplished by 


the research 
1 had not t 


I 
these 


center staff of 
testing an 


means ol 
ind the 


ginning 


tests evaluations 
d at the be 


and 


periormeé 


of data collection. All test rating re 
re 


kept confidential. 
Criteria 


rhree criter 1 


ia of creativity were employee 
A hy pothetic il 


riteri 


1 1¢ 


n 


selected } 


to 
feed 


Re search 


e director 
° r ] mnar 
for another cereal and compa! 


Oats Company Center has 
take 


the 


you with you to your new organization 


persons now at the Quaker Oats C 
Center. You hav 
inagement 

sons who will 


tain o! om 


pany Research i ¢ 


by 


pe 


yo new m to bring 


I make the most 
l, and la fing 


contributions 
evaluation 


bo. ] 
above, dl 


specialization 


the 65 


per 


accordance 


sons 


Ea 


ictions 


basis 
in 
the 


} 


| 
ne Livi 


i\ 
per group, Group 
scribed | the 
and 
ly 


group ( led into fix 


\ being 


te 


f 13 


ibg oups 
thos 


Grouy 


down 


most 


* criterion, 


described, so on 
described 
were 


These 


rted 


most 


through 13 
eat h group 


later 


then 
lette 4 


} 


member 


combinations re 


1 


we 
conve into one ranking from 
through 65, thus obviating th | } 
tinuously ranking 65 ns 


pers 


posed 


Employing the technique | 
ich subject’s rank score was con 
ale amenabk 

The relation between criterion. raters 
Consequently, the 
scores served as his criterion 
the 
foll 


O1LO 


by 
erted 


hy 


nro 


Hull 


into a 
S¢ score, to treatment linear corr 


tion methods 
was 


cor 


43 average ol 


linear scale score 


Creativeness: a subsection of company ap 
praisal form, which was defined as ws i 


d Virginia M. Bachner 


Ability to approach problems with an inquiring 
mind, with vision and imagination, to develop new 
ideas and improved methods, and to provide origi 


nal solutions to problems 


Ratings 
values from 


on this criterion the 
1 to 3 and were assigned as part of t 
evaluation of personnel. The who 
some instances the 
the Forced-Choice 
months 
biasing effects 
of previous ratings 


were in form of 


he 


Oo 


vearly 


raters as 


signed these values were in 
rated 
However, the 
evaluations 
ot 


same 

Instru 
the 
a 


persons 
ment 


who with 
12 


any 


9 between two 


reduced 
memory 


as fun 


tion 


3. Proportional Contribution to Patents Issued 


wherein each single author patent was scored as on¢ 


point; if authored by more than one 


point was divided equally among the seve 
The fi 


forced 


the 


hors 


person, 
al 
st criterion discussed was collected after 
choice p 
evaluation had been 
the study. The 
terion was obtained from personnel records 


I aut 
I 


the 


evaluation of the articipants, while 
the second criterion I performed 


third 
Because 

the temporal order in which the criteria were col 

lected, relatively little bias is assumed to have 

l inted st 

eV 


prior to the inception of ' 


f 


ot 


crept 


n by wavy of sl te scores or forced-choice 
} 
+] 


uations 


{nalysi 


' r 7 ly 
respectively, 


nt 
’ t 


; 1j 


th li 


” 
e Vaile 
between 


signiticant 
Cone 


predicto 
relations validity 7 


sulted from correlating 


th 
witl 
th 


1 the criterion 
— 
il 


split t cal 
lations reliability of 
obtained, increased to application of 
3rown formula 


approximation 
other situations 
Where necessar\ 
ranks a the 
in the 
reduced number 


form of Pears¢ 


uneven split ll u 


is spec ified 
in the 


sequently 


idjustments made gned by 
codirectors and sub 
to account for 


linear 


Se 
ie 


itions are in the yn prod ict-moment 


coefficients 


RESt LTS AND DISCUSSION 


Inspection of Table 1 leads to some inter- 


esting observations concerning the application 


of standardized psychometric material for dis 
criminating 


vetween various levels of creative 
ability. Persons whose research contributions 
are “significant, original, and lasting” (Cri- 
terion 1) tend to be active dominant, and co- 
operative; are interested in literary pursuits 
and are uninterested in clerical pursufts. Fur- 
ther, they are quite effectively identified by 


the Forced-Choice Instrument. 


Persons who 
ire considered to possess “creativeness” (Cri 





Creativity in a Research Setting 


rABLE 1 


COEFFICIENT 


\daptability Test 


rhurstone Tempe 


Scheduk 
Activ 
Vigorous 
Impulsive 


Dominant 


( ooperative 
\!port-Ver 


Study of Va 


terion 2) are more intelligent, are interested Previously reported work lends credulity to 
in things literary, and are quite easily identi- some of the findings reported here. In relation 
fied by means of the Forced-Choice Instru- to activjty, Barron (1957) demonstrates that 
ment. Relative to the number of “patents original persons tend to be more “energetic’ 
issued” (Criterion 3), persons with literary whereas unoriginal persons tend to be “slow’ 
interests as well as those identifiable by means and “apathetic Elsewhere Barron (1955) 


of the Forced-Choice Instrument tend to 


hypothesizes and demonstrates that ‘“Domi- 


patent nance” is characteristic of original persons 





William D. Buel and Virginia M. Bachner 


TABLE 2 
AMONG SIGNIFICANT PREDICTORS 


(N = 54) 


INTERCORRELATIONS 


Guilford 
\dapt Thurstone: Martin 


Kuder 


ability 
Predictor Test Active 


Active 
Dominant 


rhurstone 
hurstone 
Guilford- Martin 
Cooperativeness 
Kuder 
Kuder 


Forced-Choice Instrument 


Literary 


Clerical 


Maizell (1958), investigating the reading 
habits of research chemists, suggests that the 
more creative are given to greater amounts of 
technical reading on the job. Albright and 
Glennon (1961) 


tendency when 


find essentially the same 
differentiating between re- 
search on a “research” vs. “administrative” 
career aspiration criterion, while Roe (1952) 
describes eminent having de- 
veloped a childhood habit of prodigious read- 
ing. Maschino (1959), considering the factors 


scientists as 


related to success in technical jobs, presents 


the effect that successful tech- 
nical personnel are disinterested in clerical 
work and, as demonstrated in numerous other 
research reports, are significantly more intelli- 
gent. Cooperativeness, although not specifi- 
work cited here, 
gains significance if one chooses to construe 
Ascendance (Barron, 1957) and Tolerance 
(Gough & Woodworth, 1960) as components 
thereof. Conversely, Roe’s work relative to 
the researcher’s need for personal independ- 
ence (1952) and lack of personal relations 
skills (1956) would seem to be contradictory. 
The fact that the correlation of Cooperative- 
ness with Criterion 3 (patents), a relatively 
objective criteria, is not significantly different 
from zero, may indicate that rater and sub- 
ject bias in favor of a socially desirable char- 


evidence to 


cally corroborated by the 


acteristic like Cooperativeness has contributed 
to the magnitude of the correlations between 
it and two somewhat subjective criteria. 
Table 2 presents the matrix of intercorrela 
tions among the variables singled out for con- 


Coopera 


Dominant tiveness Literary Clerical 


sideration as discriminators between relatively 
creative and noncreative research persons. It 
is apparent that, with the possible excep- 
tion of Dominance and Clerical (—.50) and 
Dominance and the Forced-Choice Instrument 
(.31), the discriminating measures are in- 
dependent of one another. Considering Ac- 
tive, Cooperativeness, Literary, and Clerical 
(Dominance contributes nothing because of 
high intercorrelations) as a predictive bat- 
tery for Criterion 1 leads to a multiple cor- 
relation of .47 (N = 54), shrunken to .42 
(Guilford, 1950; Kelley & Salisbury, 1926), 
significant at the .01 level of confidence. Se- 
lecting the Adaptability Test and Literary as 
predictors of Criterion 2 results in a multiple 
correlation of .43, shrunken to .41, 
cant at the .01 level (V = 42). 

Although the authors have heretofore treated 
the Forced-Choice Instrument as a predictor 
of various criteria, it is equally justifiable to 
view it in the role of a criterion. Using the 
Forced-Choice Instrument (N = 54) as a 
criterion results in a validity for Dominance 
of .31 (.04 level) and for Theoretical of .27 
(.06 level), between themselves 
-.03 (presented here but not in tables). Com- 
bining the two eventuates in a multiple cor- 
relation of .41, shrunken to .39 (.01 level). 
It appears that, even within the limits of this 
study, the Forced-Choice Instrument is a rea- 
sonable criterion. 


signifi- 


correlating 


Table 3 presents the correlations between 
the criteria. Criterion 3 (patents) is more in 


dependent than Criterion 2 (creativeness) 





Creativity in a Research Setting 


rABLE 3 


CRITERION INTERCORRELATION 
Criteri 


1 (Ranking 


2 (Creativeness 


and Criterion 1 (hypothetical) is the least in- 
dependent of the three, suggesting that it may 
be the most representative criteria. Criterion 
3 is relatively independent but at the same 
time reflects the possibility that it is appar- 
ently not measuring creativity as conceived 
of by research directors and supervisors. The 
lower correlations between Criterion 3 (pat- 
ents) and the other two criteria may reflect 
the many unaccounted for variables involved 
in securing patents or, perhaps, the entrepre- 
neurship qualities operating in research per- 
sonnel who obtain patents 

lo use a patent criteria in such a study 


may be suspect when one recalls that certain 
of the subjects had been members of tl 
search center staff a relatively 


Obviously such persons, even if ' 
tive, had not had a chance to patent within 
included as a 


this organization. Patents were 


criteria more because of availability than be- 
cause of their purity as a measure of creative 
ability, even had all persons been staff mem- 
such a criterion 


that 


enough to make 


I he 


patents is not a 


bers long 


equitable authors feel mere pro- 


fusity of true indicator of 
creative ability. 

Future investigations might develop the 
merit of patents as criteria if the differential 
values of patents were considered. If some 
method of quantifying the value of individual 
patents (perhaps by a panel consisting of the 
patent author and several experts) were de- 
patent criterion 
With regard to the results of this 


subtests which approach significance 


veloped, a more meaningful 
might result 
study 
for one or several criteria, but which have not 
previously been discussed, might bear further 
scrutiny and item analysis. Stable, Reflective, 
Objective Artistic 
might 


Avreeableness (negative) 


(negative), and Religious (negative) 


well be developed into significant indicators of 
the personality and value structure of creative 
persons. 


SUMMARY 


Psychometric instruments of varying na- 


ture were used to evaluate research person- 
nel in a major cereal and feed company re- 
search center. Three criteria of creativity 
were collected against which the psychometric 
instrument validities were calculated. Certain 
intelligence, interest, and personality variables 
and the Forced-Choice Instrument were dem- 
onstrated to be valid, singly and in combina 
between relatively 


tion, for differentiating 


creative and noncreative research workers 


REFERENCES 


hict 


Arpricnt, L. E., & GLENNON sonal histor 


scientis 


correlates of physical 


J. appl. Psvchol., 1961, 45 


aspiratio 
sarkON, F. The disposition t 
Psychol., 1 


choice rating scale 

10% 

BuE! 
items for 
J. appl. Psychol., 19 

1, H. G.. & Woopw 

among professional r 


60. 49. 87-98 
P. Fund 


assessment 


tions 


correlation 


6. 21. 28 


method | 

stants ner. Statist. A 1 
MAIZE! The most creative chemis a 

mort lusts rene. Chem., 1958, 50, 644-654 
MAscHiINn tors related 

cal jot Midland, Mich 

Mime: 

or, A. A psvchologist 


{mer., 


] 
1 


success in techni 
Dow Chemical, 195! 
eXamines sixtyv-lour eminent 
1952, 187, 21-25 
or. A. The psychology of ations. New York 
Wilev, 1956 
unpouist, E. A 
In D. H. Frver & E. R 

f applied psychology. Ne 
Pp. 182-191 


scientists. Scient 


Personality tests and prediction 
Henry (Eds.), Handbook 
York: Rinehart, 1 





358 William D. Buel and Virginia M. Bachner 


SMitH, W. J., ALBRIGHT, L. E., Grennon, J. R., &  Srein, M. I. The survey for administrators. Chicago 
Owens, W. A. The prediction of research com Science Research Associates, 1960. (b) 
petence and creativity from personal history. J 
appl. Psychol., 1961, 45, 59-62 

Sprecuer, T. B. A study of engineers’ criteria for 
creativity. J. appl. Psychol., 1959, 43, 141-148 

Stein, M. I. The research environment survey. Chi 


TirFin, J., & Lawsne, C. H. Examiner manual for 
the Adaptability Test. Chicago: Science Research 
Associates, 1954 


cago: Science Research Associates, 1960. (a) Received December 9, 1960) 





Applied Psychology 
45 N t 359-3 


PILOT JUDGMENTS OF SIMULATED COLLISIONS 


AND NEAR 


A COMPARISON OF 


PERFORMANCE WITH 


MISSES: 


UNCODED AND 


TWO-TONE CODED MODELS 


JOHN E. ROBINSON, JR 


ipplied P 


Aircraft pilots agree that avoiding midair 
collisions in Visual Flight Rules weather ' in- 
volves more than merely sighting the intruder 
aircraft; but they do not agree on just how 
much more: how many kinds of information 
are needed and what degrees of precision are 
(Calvert, 1958; Robinson, 
This question continues to arouse 


required 1959) 
irguments 
in ready-rooms, at manufacturers’ confer- 
ences, and at flight safety meetings and regu- 
latory deliberations 

Projector and Robinson (1958) in an ana 
lytical survey of aircraft navigation light sys 
tems list eight kinds of information mentioned 
in aviation literature 


as important, to some 


unspecified degree, in avoiding collisions dur 
ing “good” weather conditions (Visual Flight 


Rules): 
Presence or | 
Identification 


Attitude 


Relative alti 


It that if 


informatio 


Presumably, it was fe 


kinds of 


aircraft, he 


a pilot could 
about “‘ir 
make 


get these 
truder”’ 


faster 


would be 


ISIONS al ! I Dos 
! 


more accurate dec 


sibility of collision, and thus, whenever avoid- 


ance was necessary, make the 


and more 


correct ma- 


neuver sooner accurately 

\ research approach to the evaluation of 
this basic concept would seem to begin with 
visual 


the measurement of perception of 


an object moving through three-dimensional 


A first 


and 


space task would be to measure the 


speed accuracy of visual perception of 


simple or elemental objects. Subsequently, 


perception of objects with more complex 


visual characteristics would be measured 


ifter additional items o Vi information 


had been provided 


P 


KENNETH G. COOK, 


AnD CHARLES E. ZELENY 


€ orporation 


\ircraft shape has been shown to be ad 
vantageous in the perception of relative mo 
tion when compared to the relative motion 
of a spherical object (Kelley, 
Frank, & Channel, 1959 


observers were seated in a corner of a 


Bowen, De 


Groot. 


Individua 
: 


black 


curtained “room.” A visible stimulus object 


whose support was invisible was moved across 
the room in a random sequence of straight- 


line directions The observer had to indicate 


his perception of the relative motion of the 


object. His answers were scored on the basis 


of speed and accuracy. The first object was 


a small, silver painted ‘re. Very little 


was available from. the 


visual information 


stimulus an tne I 


shape of the 
fl 


SCOTeS t 


ected certain amounts of erroneous relative 
motion judgment. It was reported that when 
the sphere was replaced with a small, silver 
painted aircraft model, judgments of relative 
motion were made significantly faster and 
more accurately. 
The experiment rej 

text designed to 
vide the 


this research progression Will the 


Was 


answer to the next 


motion of aircraft models ainted half 


and half light be perceived 


sooner an 
accurately than the relative motion 
painted all light? 
| - 
The answer was sou pres 
De 
COLLISIONS 
F-100 flight 
fsunnery 


served as 


number ulated 


near-collisions, using al 
in F-151 aerial 
Pilots 


ratus which made 


entation of a 
} 


and 


simulator and trainer 


attachment who subjects 


used app continuous re¢ 


ords of their judgments during the simulated 


FAA BRD 





360 


flight situations. The three judgments pos- 
sible were “Undecided,” “Collision,” 
“Miss.” ‘ 

The paint patterns used on the intruder 
aircraft models were in no way intended to 
presage operational paint schemes for full- 
sized aircraft. Rather, they were designed to 
code certain visual aspects of the aircraft to 
see if such coding would help pilots make 
better judgments than could be made with an 
uncoded model. If the aspect information had 
turned out to be of value, application would 
have been developed for both daytime opera- 
tions (paint) and nighttime operations (lights). 


and 


PROCEDURE 


Each pilot who participated in this study solved 
his test problems while seated in the cockpit of an 
F-100 jet fighter in one of the simulator rooms at 
the Federal Aviation Agency’s National Aviation Fa 
cilities Experimental Center, Atlantic City, New Jet 
sey. In front of this cockpit was 
screen, approximately two stories in height and ex 
tending 240 degrees in azimuth. The general level of 
lighting represented moonlight conditions (7 to lf 
foot-lamberts ambient light). No 
were visible, but a fuzzy horizon was distinguishable 


a concave plaster 


terrain features 


at the border of the “moonlit” area 

A closed circuit television system was used to pro 
ject an image of a B-47 
series of different courses relative to the course of 
the cockpit. These included flight towards the F-10( 
and away from it on one of six 
Relative altitude of the courses was 


scale model aircraft in a 


relative headings 
also varied, with 
the B-47 image being made at various times to ap 
pear to be ascending towards, descending 
flying level with the F-100 
times a 


upon, or 
The outcome was some- 
“collision,” at other times a “near-collision,” 
and at others a clear miss 

The “intruder” image appeared on the screen as a 
white with a_ brightness 
The brightness of the “sky” (the area of the 
above the line) was .1 foot-lambert. The 
maximum simulated range was 24,000 feet, at which 
B-47 appeared 
than the 


intruder 


area of .55 foot-lamberts 
screen 


horizon 


distance the television image of the 
white dot. At 
possible 


( loser 


feet) the 


only as a distances 


minimum range (606 
image disappeared 

A separately located television camera was focused 
aircraft, which by 
could be moved various distances 
simulate 


on the scale-model means of a 


mount irom 
camera to 


tance 


changes in the intruder’s 

A gimballed suspension system permitted ma 
neuver of the scale model in pitch, roll, and yaw 
The amount and speed of rotation in each of these 
three flight dimensions, or any combination of them 
were controlled by analog computer equipment 
mechanisms 


motion and 


complex driving 


parent 


assured that the ap 


] 


maneuvers of the intruder im 


4d. E. Robinson, Jr., K. G. Cook, and C. E. Zeleny 


age were faithfully maintained 
cockpit in which the pilot sat 

The fact that computer settings could be made to 
start a predetermined flight situation, and carry it to 
completion, made it possible to have standardized test 
problems. Thus, the flight situations 
be presented reasonable 


with respect to the 


selected could 
with 


that successive presentations were identical 


repeatedly assurance 
Six FAA pilots (two military, four civilian) served 
All had had flight experience greatly in 
excess of the 1,000-hour minimum which the investi 
gators established. All were actively engaged in fly- 
ing at the time of the experiment, and the nature of 
their duties at the experimental center supports the 
belief that all possessed more than average interest 
in experimentation aimed at improving air safety 


as subjec ts 


Each subject was provided with 2 hours of ori 
entation and practice prior to participating in four 
test sessions. Each test session contained 6 practic« 
problems and 12 test problems involving one model 
The order in which the models were assigned to test 
targets was counterbalanced to prevent 
With indi 


seconds, 


Ssess1ons is 


bias from learning or practice effects 
vidual problem times varying from 34 to 
plus the time required to reset the computer for each 
new problem, single test lasted 


mately 50 minutes 


sessions approxi 


As soon as a problem was ready for presentation, 


the experimenter alerted the pilot (“Your next prob 
lem_ will o'clock 
and level’). The pilot was not required in any case 
to make 


only to judge whether or not the B-47 image 


appear soon at approximately 1¢ 


evasive maneuvers, but instead was asked 
would 
“collide” with his aircraft, assuming that speeds and 
flight paths would remain constant 

By pressing one of three keys on a board strapped 
to his knee, the pilot indicated either that he was or 
was not on a collision course, or that he was un 


decided. The keys 


the control room 


were wired to a pen recorder in 
The resulting pen-tr 


vided a means of measuring the time and nature of 


ace record pro 
each judgment. A second pen, wired in circuit with 


he master switch, recorded the exact starting time 


f each problem 

The intruding B-47 
four separate One of the 
Model A) was painted entirely white, therefore re 
quiring the pilot to make his 


images were visually coded in 


patterns scale models 


collision judgments 
only on the basis of apparent aircraft structure and 
relative motion. The other three models were painted 
in two-tone schemes, as follows 


Model B: front half 
back half 

Model C: top halt 
bottom half 

Model D: left halt 
right half 


white 
dark 
white 
dark 
white 


dark 


The complete experiment consisted of two presenta 
tions each of © test J oblems for each of the four 
The total 
; } 


available for analysis from the 


intruder models number of problems 


six pilots thus cam 





Pilot Judgments of Simulated Collisions 


to 288. (The 144 practice problems were not included 
in the analysis of results.) 

Pilot judgments were scored in terms of accuracy 
and speed. Accuracy consisted of “correct” 
and “incorrect” categories, but were further classified 
in terms of initial judgments (decisions made earliest 
in each problem) and final judgments (the decision 
that was being indicated 
lem). 


‘ 


scoring 


at the end of each prob 


For scoring of speed, five categories were deline 
ated: (a) “Search” time (from start of problem to 
beginning of Undecided trace), (6) “Undecided” 
time (total time the pen Undecided 
area), (c) time to reach initial collision judgment 
(the sum of Search and Undecided times), (d) “Cor 
rect Response” time (total time the 
indicating the problem 
time 


trace was in 


pen tr 
outcome 


ict was 
correct and (¢ 
“Incorrect Response” (total time the pen 
an incorrect problem outcome). B« 
different time 
36 problems, actual time were 
into proportions of total problem time, and analysis 


was based on the proportional values 


trace 
indicating 
there 


was 


among the 


cause were lengths 


values transformed 


After all the pilots had completed all test sessions 
individual interviews were conducted to 
formation relevant to the data that had 
lected. Topics included in the Interview Guide 
adequacy of 


obtain in 
been col 
wert 
convenience of 


training, response ap 


paratus, realism of the visual presentations, nature 
of paint-coding that yielded best judgments, chances 
of correctly flight 


length, and 


anticipating 
appropriateness of test 
which judgments were made 


partic ular Situations 


session bases on 


RESULTS 


The analysis of the data resulting from the 


experiments can be discussed conveniently 


under three topics: accuracy of judgments, 


speed of judgments, and effect of flight situa 
tion. 
Table 1 
the pilot judgments for accuracy according to 
individual pilots and individual target paint 
schemes. The numbers of correct judgments 
both initial and final, indicate that the pilots 
were highly skilled. Of the 288 initial judg- 
ments, 201 (70%) while 261 
(91%) of final judgments were right. The 
subjects, as individuals, varied widely in ac 
curacy of initia! judgments, 
46% to 83% 
ever, the six pilots were very nearly equally 
correct, ranging The 
variation in response accuracy classified ac 


summarizes the results of 


SCOTING 


were 


correct; 


from 
correct. On final responses, how 


ranging 


from 85°) to 98% 
cording to the four paint-coded models is 
small, both for initial and final judgments 
Correct initial judgments ranged between 64 


rABLE 1 


ACCURACY OF PrILoTs’ JUDGMEN' 


Model paint cod 


Top 
Front- bot Lef 


hite back tom right n 


and 75 while on final judgments t] 
rect responses ranged from 89 
the 288 problems, 


Examination of the results 


pilot judgment records for time of 


] ici 
aecisior 


reveals an interesting pattern 
Four of the six pilots required nearly identical 
time to detect 
two pilots were markedly than the 


others. When Search time and Undecided time 


of variability 


amounts of the targets, while 


laste! 


are summed, a considerable spread in propor 


AO 


tion of problem time (23—69 was evident 


among the six pilots. The variation in the 


iking Correct 


proportion of time spent il n 


_ 


judgment responses ranged from 27 to 


54°) ; the time spent in making Incorrect re 


» 


to 20 ol total prol 
lem time. Only in the case of Undecided time 


sponses varied from 4 


was the variability among pilots statistically 
significant beyond the 


Ol level. The different 
vaint-codings did not cause statistically sig 
| 


the 


nificant variation in proportion of tota 





362 


problem time spent on any response cate- 
gory except for Search time. Here, all of the 
two-tone models show longer times than the 
all-white 
from the 


model. This undoubtedly resulted 
fact that the “dark” portion of the 
two-tone models (which was in actuality a 
mottled gray) reduced the total brightness of 
these images as compared with the brightness 
of the all-white image. 

Examination of scores to determine the ef- 
fect of relative flight path on the accuracy and 
speed of pilots’ judgments reveals that this 
variable did not appear to affect the responses 
significantly. Although individual 
occur among pilots and among categories of 


differences 


data, the variability does not reach statisti- 
cal significance at Con- 
sideration of the in practical or 
operational terms similarly shows no differ- 
to the 
support f 


conventional levels 


variability 
ence among several targets 
that would 
codings as beneficial to the 


responses 
warrant the two-tone 
visual evaluation 
of collision threats 

The amount of vertical separation in flight 
path (at the completion of the problems) ap- 
pears to have a general relation to the speed 
of pilots’ judgments. That is, examination of 
the Undecided time reveals that the greater 
the vertical miss, the 
The closest vertical miss (4 
of the horizontal 
the longest times for making decisions 


{ 
1 
shorter the 


indecision. 
feet) and both 
} 


crossover misses requires 


In the postexperiment interviews, most of 
the pilots felt they did best with the all-white 
they felt that 


probably 


model, and the top-bottom 
difficult 


and 


coded model most 
Their 
training 
that a 


adequate 


was 


comments about the orientation 


were all favorable. and indicated 


little less training might have been 


The response apparatus was judged 
and free from ambiguity 
The length of the tes 


produc e no fatigue or fee 


convenient to use 
t sessions appeared to 
lings of unreasonable 
demands 

Comment on the realism of the problems 
pointed up the fact that seeing a bright air 
craft against 
light 
would be 
the pilots 


i dark sky—as if a distant spot- 
intruder 


Neverthe 


stated that the appearance of 


only the 


were illuminating 


omewhat rare in nature 
less 
the problen < 


was sufficiently realistic to per 


Robinson, Jr., K. G. Cook, and ( 


Zelen\ 


mit their use in this phase of collision avoid- 
ance research. 

In reaching their collision judgment de- 
cisions, all pilots reported that their basic 
method was to look for relative motion of the 
intruder image. If the image appeared to have 
any perceptible movement, the chances were 
good that a miss would result. Since the cock- 
pit was perfectly steady during the problems, 
the relative motion cue could be (and was) 


utilized to great advantage 


CONCLUSIONS 
It is felt that two major conclusions may 
be made from the data obtained in this in- 
vestigation. 
First, 
ments of visually coded targets showed no sta- 
tistically significant improvement over judg- 


since the accuracy of collision judg 


ments made with an uncoded target, sector 


coding of the type used here does not appear 
to be a substantial aid to the pilot who must 


make a visual evaluation of midair collision 
threats. 

Second, the consistent reports of the pilots 
in the postexperiment interviews indicate that 
their fundamental visual cue for making col 
lision-or-miss decisions is perceived relative 
motion of the target image. 

The investigators believe both findings have 
important implications in determining the 
amount and kinds of information a pilot re 
midair collisions in VFR op 


quires to avoid ) 


erations. 
SUMMARY 


An F-1 flight and an F-15] 
aerial gunnery trainer with a black-and-white, 


simulator 


closed circuit television system were used to 
present standardized visual flight problems in 
volving a variety of relative course situations 
and altitude changes, resulting sometimes in 


“collisions” and sometimes in one of a variety 
of “near-collisions.”’ 
Six experienced pilots completed a total of 
4 training and practice problems and 288 
test problems. Objective records taken during 


each problem permitted quantitative measure 
ind speed with which 


ment of the accur icy 


the pilots were able to identify the outcome 


is collision Or MISS 





Pilot Judgments of Simulated Collisions 


Four target images, all generated from or a safe passage; and, that pilots rely on per 


miniature B-47 models, were presented in ception of relative motion more than any 


counterbaianced test session order and in ran- other factor in judging the probability of col 
dom sequences of problems. One model was _ liding with another aircraft 

painted all white, the other three were treated 
with dark and light paint to identify pairs of 
the cardinal aspects of any aircraft: top versus 


REFERENCES 


bottom, front versus back, and left versus 

lision national Civi 

sin . : , ‘ ‘ : ganization, Working er AIR C-WP 
The objective of the investigation was to me ; + lat rep ape e at M 


right. 


determine whether or not these visual codings Airworthins 
would improve pilot ability to judge the out- 
comes of the simulated flight situations. 

Statistical analysis was accomplished in 
terms of descriptive values for accuracy and 
speed, and of inferences drawn from analysis 
of variance. No statistically significant differ- 
ences relevant to the objet tives of the study 
were obtained between the two-tone coded 
models and the all-white uncoded model. The 
investigators believe it is appropriate to con- 
clude that visual coding of the type studied 
is not of significant aid to a pilot in visually 


determining the existence of a collision threat 





Psychology 


6. 364 68 


Journal of Applied 
1961, Vol. 45 


VARIABILITY OF STROKE 


CHARLES L 


International Business 


Typically, studies of stroke width of nu- 
merals have used a uniform stroke width over- 
all. Chapanis, Garner, and Morgan (1949) 
suggested that this may not be the optimum 
condition, and they outlined some further re- 
search needed in which the various compo- 
nents of the digits are emphasized by varied 
stroke width within the numeral. This offered 
a new dimension in digit design. Studies di- 
rectly relating to the present problem are in 
short supply, with the exception of an article 
by Soar (1958). Starting from an analysis of 
research results showing the confusion result- 
ing from the use of conventional arabic nu- 
merals, Soar developed new sets of arabic 
digits in which parts of the numerals were 
distorted in form and/or stroke width to em- 
phasize a difference between them. The com- 
mon elements were minimized and the unique 
elements were emphasized. Six of the experi- 
mental digits proved to be significantly more 
visible than the usual arabic digits. 

The relative readability of different digits 
depends on the criterion used as a legibility 
measure, the style or form of the numerals, 
and the stroke width. The digits 0, 3, 6, 
9 are most often confused with each other. 
They usually have curved rounded outlines. 
In most studies the digit 7 generally has the 
best visibility. This gives part of the rationale 


and 


for the trend toward angles and away from 
curves. ‘“‘Angularity” is the term applied to 
the de-emphasis of roundness or curvature of 
form and the sharper curves and 
straight lines in the numeral, as in the 0, 3, 
6, and 9 versus the 7. 

Two earlier experiments by Soar (1955a, 
1955b) served to fill in the background lead- 
ing up to the present problem. Significant in- 
stroke width, illumination 
level, and figure-ground contrast was demon- 
strated, with stroke width to height ratios the 
same overall 


use of 


teraction between 


(constant even stroke over the 
entire figure). Conventional arabic numerals 
were used, but modified in form somewhat to 
more angularity 


Machines, Neu 


WIDTH WITHIN DIGITS 
HUGHES 
York 


Additional evidence has been compiled 
which suggests that angular straight-line fig- 
ures are more readable with symbolic digits 
receiving increasing but contradictory atten- 
tion. Symbolic digits are simplified straight- 
line figures that have cognates in the arabic 
numerals. They have no curved lines. Some 
figures that are in between in form (dis- 
torted, angular digits) were used by Berger 
(1944a, 1944b). 

In an often cited study Mackworth (1944) 
devised some new numerals and letters that 
simplified the components and curvature of 
conventional The new angular digits 
were readable at significantly greater distances 
than the conventional ones. Several other stud- 
ies have made comparisons of the relative visi- 
bilities of different designs, namely, Brown, 
Lowery, and Willis (1951) and Lansdell 
(1954). McLaughlin (1948) compared some 
standard 


forms 


sets of digits against new designs 
having variability of form and stroke width 
within the numerals. The results that 
the form of the digits was not significant, but 


the within-varying stroke width was. This is 


show 


at odds with the previously reported studies 


where form and even-overall stroke widths 
were both significant. The results are further 
confused by a retest of McLaughlin’s study 
1958) in which the digit 
significant. Varying stroke widths within con- 
ventional numerals complicates the problem 
that the stroke width 
changes cannot be separated. 

Many studies have been done on the effect 
of stroke width to height ratios upon visi- 
bility. In all cases the same stroke width was 
used for the entire set of digits and was not 
varied within any set or within any single 
digit. Berger (1944a, 1944b), Brown (1953), 
Brown and Lowery 1949) all obtained similar 
results. In addition Shapiro (1951), Loucks 
(1944), Craik (1941), Kuntz and Sleight 
(1950), Forbes and Holmes (1939) all ob- 
tained similar results with optimal stroke 
width to height falling between 1:6 


(Soar, forms were 


so curvature and 


ratios 





Variability of Stroke Width within Digits 


and 1:8. 
Soar 


Actually only McLaughlin (1948) 
and (1958) varied 
within digits. Furthermore, as we have men 
tioned they could only make a partial investi- 
gation of the relative contributions of stroke 
widths and form variation to readability of 


have stroke widths 


digits. 

All these studies have included variations 
of conventional arabic numerals. Cohen and 
Webb (1953) suggested symbolic numerals 
made from a six-element straight-line matrix. 
These particular symbolic digits did not give 
as good a performance as conventional nu- 
merals. The study did result in a suggested 
eight-element straight-line matrix which was 
tested (Allusi & Martin, 1958; Allusi & Mul- 
ler, numerals 
was not very different from performance on 
The symbolic digits, 


1956). Performance on these 


conventional numbers 
however, offer a way to approach the prob- 
lem of form, within-variable stroke width 
and their interaction. 

The primary conclusions which may be 
drawn from the previous research on digit 
cited above would to be as 


design appeal 


follows: 
stroke lie 
ratios) 1:6 to 1:8 


1. The degrees of boldness of 
within the range (SW/H 
for best visibility. 

2. Overall width to height ratios may range 
from 1:1 to 7:10. 

3. Angularity increases visibility. 

4. Increased inclosed white space aids visi- 
bility. 

5. No adequate theory or hypothesis has 
yet been formulated to account for the differ- 
ences in visibility demonstrated in research 
studies. 


Questions which remain to be answered are: 


1. What are the relative visibilities of: con- 
tentional arabic numerals and symbolic nu- 
merals? 
2. Do 


form and stroke width interact in 
digit perception? 

3. Do some digits require greater or lesser 
stroke width for best visibility than other 
digits (should stroke width vary within a set 
of 10 digits) ? 

4. Do some digits require a variation of 


stroke width within that particular digit, 


through stroke width emphasis of some part 
of the digit for best visibility (should stroke 
width vary within a single digit) ? 


PROBLEM AND METHOD 


The present study was an attempt t 


tematically the effect of variations in 
stroke 
emphasizing the 
stroke boldness, 
interaction of 

Subject 


width, conventional versus 


unique aspects 


angularity versus cur 
these variables 

The subjects Ss) were 
course in Engineering 


enrolled in a 


with normal vision or with optical corr 


The muli were a set 


digit 


{ pparainu 
which the digits were drawn, one 
hich and were of two stroke widths. The 
height ratios (SW/H 
stroke widths then were 
1:8). Eight “types” 


digit 
stroke 


1:6 or 1:8 


width to were either 
These 
phasized (1:6 


10 of each (1 


such as the 1 


either en 


or unem phasized 
of digits, through were used for a 
and were dupli 


total of 80. Some, 


cates of other types and were included to prevent 


inbalance in the presentation of each 
form and 


jects. The 
Type \ 
width, SW/H 1:8 
Type B—conventional 
width, SW/H 1:6 
Type C 
fm 1:8 


irequency ol 
is an offset for guessing habits on the 
tvpes were (see Figure 1 


conventional arabic, ev 
arabic, ven-overal 


symbolic, even-overall st: 





Type E—symbolic, 
nents, SW 
Type F 
nents, SW 
Type G 
nents, SW 
Type H 


omponents 


H 1:6 and 1:8 
symbolic 


H 1 


emphasize 


~} 


ks 


wert 


Training. T 
lata gathering 


VO wet and one 
the Ss 

; ] ] 
conventional and symboli 
the 
and the 


fully 


lus cards 
in the 


The level of 


using actual stimu 


method used 


described 


d by having a 
ly all of the 


symbolic 


rate 
digits 
tional 
of ti 

Procedu e Ss 


rabic numer 
ining 
complete trial for 
the entir: 
were reassembled 


The 


this w 


lom or 
initial stin 


as dec 


visual conditions 


peated. Onl 


than twice by 


to show an 
guess responses 
esponst After 
the 
ne same tor 
corded 
read the stimulus card co 
Statistical Analy 
vidth, and the 
analysis of 
digits, Ty 
numerals 
The 


treatment 


in te 


rms of d 


inter 
irlance 


+} 


pes ( 
and the special 
signs inalyses of al 
separate 
tests, 


one for each forn 


Duncan’s shortest significant 
which particular 
which other 


nition of 


to indicate 
differe 
Deft 


following terms 


nt from 


lefinitions giv 
the 
tive of the stroke 
and the 


Stroke 


ven ner 


Form shape or contour of 
Variation e.g 
3 has a diffe 


Width—th 


rent torm, et 


SW 


ratios of 


] 
a \ 


ld , 
pecial design with only 


ommon elements 


we 
design of 

learning 
Ss individual 


rations 


timuh 
Stimulus 


incomp 


the 


g., the 2 ha 


Charles 


SW 


width 


emphasized horizontal compo 


rti 


unique 


absent, SW/H 1 


week prior the 


familiarized with both th 


digits 


training 
all Ss 


Specific 
re given to 
the digits was 
achie\ ed 


consisted 


digit irrespe 


one torn 


] 


L. 


Hughes 


rABLE 1 
EFFECTS OF For 
WIDTH 


Vi 


STROKE 


9900 110.00 
10.00 
10.36 
378.00 
7.46 
$55 


3.59 


eg t 
ized, nonbold stroke 
the total 


Variations 


bold strok 
Configuration 
SW 


“x, the 


n and 


elements the diag« n 


not 


present 


binations 


mM 


RESULTS 


lal 


ance and shows 


le 1 gives the factorial analysis of vari 


form and SW interacting sig 


nificantly level of confidence) for the 


different forms did 
different 


stroke width was held constant 


sv nbolic digits ‘he not 
give significantly visibilities when 


Form changes 
and SW variations together accounted for dif 
ferences in perce ptibility of the configurations 
In other increased 


words of 


varying the SW 


perce ptibility 


digits can be obtained by 


emphasis of certain forms. Contrary to prior 


studies, the symbolic digits tended to be 


more 
perceptible 
DISCUSSION 
The of SW variations and 
form that was more perceptible varied greatly 


configuration 
No regular patterns appeared. In short, each 
digit configuration is a strictly unique event 
The part of a digit requiring emphasis de 
pends on that digit 

\ few other clues can be pointed out. The 
factor 


“openness” suggested by other experi 





Variability of Stroke 


ments is the same for all symbolic types so its 
validity as a perception factor is questionable. 
The results that varying SW 
within a digit may increase visibility. In most 


demonstrate 


cases the svmbolic configurations were more 
the 


Comparing some of the 


conventional 
digits, the 
variation and lack of consistency from form 
t best SW 


differences in 


perceptible than either of 
types 
the configurations is 
The 
tion in arabic and symbolic 


é 
however, there were 


o form with 


demonstrated. configura- 
7’s are small: 
differences in the visi 


bility. This raises a question on the use of 
anything other than straight lines for digit 
The differences in 
the 


evidence th il for 


construction narrow over- 


ill stroke and wider overall stroke 


con 
figurations give some digit 
forms a narrower ratio is better and for others 


stroke This 


would relate to the interactior demon 


1 wider gives visibility 


effects 


greater 


strated. Simplicity of form shows up as an 


For 


significantly 


asset to perception in several cases the 


all the symbolic types were 


more visible than the two conventional 
Ss ] > only 


conventional 


arabic 
difference in th I 


and svmboli 


mer is circular and the latter 


par this with the curved versus 
sults noted above 

Consider the proble m interac 
stroke width and the shape of the dig 
that the 


levels of 


data show 
different 


+} +7 ] 
he particular 


same shaped dig 
thilit } + 
Visibility depending upon 
emphasis of SW made. The de 
termination of 
hows they do signifi antl effect the 
I Whereas i 


svmi 
} 


previous studies stroke width was varied the 


interaction effects of these two 
variables s 


legibility of the yolic digits 


same overall each figure. and overall each set 


of 10 digits as well, we a variatio1 


also 
The results are at odds with previous stud 


ies. Pre viously stroke 


allowed for 


within each set, and within each numeral 


width (being the same 


overall each digit in other research designs 


was secondary in effect to form wher 


overall SW 


ent studies show that varving the 


certain 
H ratios were observed ‘he pres 
stroke width 
within each digit can increase visibility, and 
Inter 


has a 


the determiner 
stroke width 
nificant. influence on the perceptibility also 


that form alone is not 


action of form and sig 


The contributions of form, stroke width, and 


the interaction take no regular pattern 


Width within Digits 367 

The significant interaction of form and SW 
makes an the effects of 
either difficult. Differences in form must have 
The 


some 


interpretation of 


some influence on per eptibility however. 
SW 


significance of form. Some types ol a 


interaction of form and presumes 


varticu 
lar form were more significantly visible tl 


*r types An inspection of the re 1] 


to show that the stroke width effects have 


more influence on perception than the 
This 


studies into 


changes 


in form interaction brings many priot 


a different light, since previously 
the other SW) was 
both. Established optimal SW/H 


vOssibly in 


only one or (form or 

varied, not 

ratios are need of a recheck 
Other studies typically determined 

stroke width to height ratios for 

ot | 


were 


in the t 
digit 
that 
creased in visibility by varving 
of the 


digits, but presel 


varied from to digit 


some digi 


indicate, however 


stroke width within individ 


representing a major departure 


methodological and theoretical 


study show that 


mcrease nNume 


SUMMARY ANI 


con ple X1t\ 


One area of 


operators 


tors rns methods 


conce 


to the operator 


} 
rm. One small but 
prot lem is desig 


the visual displays 
The results of previous research on d t ce 


sign were rized as follows: The opt 


summ 


i 
mum stroke width to height ratios 


apparently determined, angularity 
opel 


shape increases visibility, and the 


digits aids the discrimination 
ibout digit « 
What are 
tive visibilities of conventional and symboli 


Do form and stroke width interact 


And. most in 


Certain questions 


approached in this study the rela 


portantly, will varving the stroke 





368 


width within a set of digits and also within 
any particular digit 
bility ? 

The problem was to determine the effects 
of the above factors through a systematic 
variation of the stroke within a set of sym- 
bolic digits (straight-line, angular forms con- 
structed from an eight-element matrix having 
cognates in the conventional digits). 

The S-to-stimulus distance was varied by 
decreasing intervals until the S could cor- 
rectly identify the digit. Eight types of digit 
configurations plus conventional arabic nu- 
merals were used. The 80 digits were pre- 
sented, one at a time, in random order to the 
Ss. Data were recorded in terms of the maxi- 
mum distance at which the S could correctly 
identify the digit. 

The results indicated that the symbolic nu- 
merals were significantly more visible than the 
conventional digits, contrary to prior reports. 
Form and stroke width showed a significant 
interaction, supporting the conclusion that 
stroke width could vary within a set of digits 
or even within a single digit so as to increase 
the discriminability. 


increase the percepti- 


We can generalize to say that each digit 


must be designed on an individual configura- 
tion basis taking form, stroke widths, and 
their combinations into account. 
must be considered: (a) absolute recognition 
of the digit as such, (4) differentiation (by 
relative emphasis) of features that suggest 
similarity (hence ambiguity) of digit 
each of the others. 


Two factors 


one 


Questions still unanswered are: what are 
the effects of decreasing illumination to near 
thresholds, tachistoscopic presentation, and 
the relative perceptability of groups of the 
symbolic digits? 


REFERENCES 
Artus!, E. A., 


analysis of 


& Martin, H. B. An informational 
verbal and motor 
bolic and conventional arabic 
Psychol., 1958, 42, 79-84 
Attus!, E. A., & Mutter, P. F 
transfer with seven symbolic 
and verbal responses. USAF 
1956, No. 56-226 
SERGER, C 
ing of numerals as determinants of the threshold 
of recognition, Part I. J. appl. Psychol., 1944, 28, 


208-231. (a) 


responses to sym 


numerals. J. appl 


Rate of information 
Motor 
Rep., 


visual codes 


WADC tech 


Stroke-width, form and horizontal spac- 


Charles L. Hughes 


Bercer, C. Stroke-width, form and horizontal spac 
ing of numerals as determinants of the threshold 
of recognition. Part I. J. appl. Psychol., 1944, 28 
336-346. (b) 

Brown, F. R. A study of the requirements for let- 
ters, numbers, and markings to be used on trans- 
illuminated aircraft panels: IV. Legibility of uni- 
form stroke capital letters as determined by size 
and height to width ratio and as compared to 
garamond bold. USN Air Material Cent. Rep., 
1953, No. TED NAM EL-609, Part 4 

Brown, F. R., & Lowery, E. A. A study of the re 
quirements for letters, numbers and markings to 
be used on trans-illuminated aircraft control pan 
els: I. The effect of stroke width upon the legibil 
ity of capital letters. USN Air Material Cent. Rep., 
1949, No. TED NAM EL-609, Part 1 

Brown, F. R., Lowery, E. A., & Wiiiis, M. P. A 
study of the requirements for letters, numbers, and 
markings to be used on transilluminated aircraft 
control panels: III. The effect of stroke width and 
form upon the legibility of numerals. USN Air 
Vaterial Cent. Rep., 1951, No. TED NAM EL-609, 
Part 3 

Cuapanis, A.,, 
Applied experimental 
Wiley, 1949 

Conen, J., & Wess, I. B 
of numerals for tape presentation 
tech. Rep., 1953, No. 54-86 

Craik, K. J. W. Instrument lighting for night use 
fir Ministry Flying Personnel Res 
Rep., Lond., 1941, No. 342 

Forses, T. W., & Hortmes, R. S 
of highway destination signs in relation to letter 
height, letter width, and reflectorization. Proc 
Highway Res. Bd., 1939, 19, 321-335 

Kuntz, J. E., & Stercut, R. B. Legibility of numer 
als: The optimal ratio of height to width of stroke 
imer. J. Psychol., 1950, 63, 567-575 

LanspeLtt, H. Effects of form on the legibility of 
numbers. Canad. J. Psychol., 1954, 8, 77-79 

Lovexs, R. B 


dials 


Garner, W. R., 
psve hology 


& Morcanx, C. T 
New York 


An experiment on coding 


USAF WAD( 


Committee 


Legibility distances 


instrument 

\ further investigation of the relative legi 
bility of tachometer dials. USAF Sch Ved 
proj. Rep., 1944, No. 1. (Project No 

MackwortH, N. H. Legibility of raid block letters 
and numbers. Air Ministry Flying Personnel Re 
Committee Rep., Cambridge U., 1944, No. 423(s) 

McLavcuuiry, S. C. Configuration and stroke-width 
in numeral legibility. Unpublished master’s thesis, 
Tufts College, 1948 

Suaptro, H. B. Factors 
digits. Amer. Psychologist, 

Soar, R. S. Height-width 
width in numeral visibility 
39, 43-46. (a) 

Soar, R. S. Stroke width, illumination level, 
ure-ground contrast in numeral visibility 
Psychol., 1955, 39, 429-432. (b) 

Soar, R. S. Numeral form as a variable in numeral 
visibility. J. appl. Psvchol., 1958, 42, 158-16 


Legibility of aircraft 
{viat 


286) 


legibility of 
1951, 6, 364. (Abstract) 
proportion and_ stroke 


J appl Psychol., 1955, 


affecting the 


and fig 


a appl 


(Early publication received June 16, 1961) 





Journal of Applied Psychology 
61, Vol. 45, No. 6, 369-375 


l 


THE INTERACTION OF INFORMATION DISPLAYS 
WITH CONTROL SYSTEM DYNAMICS IN 
CONTINUOUS TRACKING ' 


R. W. OBERMAYER, W. F 


SWARTZ, 


4. MUCKLER 


AND F 


Martin Company 


Perhaps the most widely used display modes 
in human tracking studies have been follow- 
ing and compensatory information displays. 
In the compensatory display mode, the op- 
erator attempts to align a moving index with 
a fixed command index. In this case, the mov- 
ing index is the sum of the operator’s control 
movements, dynamics, and 
some arbitrary forcing function. In the follow- 
ing display technique, on the other hand, the 
forcing function is programed directly to the 
command index providing a moving profile of 
desired performance. The second moving in- 
dex then becomes directly the resultant of the 
operator’s cont.ol output. In both cases, of 


control system 


course, the objective is to null the error be- 
tween the two display indices. Experimental 
comparisons of following and compensatory 
displays have been predominantly in favor 
of the following display (e.g., Poulton, 1950; 
Senders & Cruzen, 1952), presumably due to 
the additional information available to the 
operator. 

However, since stimulus and response are 
clearly interdependent in a 
feedback control system, it 
to find instances where the effects on track- 
ing performance of type of information dis- 
play and control system dynamics interact 
Chernikoff & Taylor, 1957). Theoreti- 
cally, one might expect that under certain 
dynamic conditions the advantages of the fol- 
lowing display might disappear. If a control 
system includes integrators, the system out- 
put may continuously change without a cor- 
responding continuous change of the opera- 


man-machine 
is not surprising 


(es... 


This work was supported in part by the United 
States Air Force under Contract No. AF 33(616) 
5472, monitored jointly by the Aerospace Medical 
Laboratory and the Flight Control Laboratory, Di 
Advanced Systems Technology, Wright 
Air Development Division, Wright-Patterson Air 
Force Base, Ohio, as a part of the Air Force Con 
trol-Display Integration Program 


rectorate of 


~ Dual Beam CRT 


tor’s control force. Relieving the operator of 
the necessity of applying control force con- 
tinuously, ur in some time-sequenced pattern, 
has been termed “unburdening” by Birming 
ham and Taylor (1954). Based on the as- 
sumption that the human operator wiil per- 
form when his function is 
may 
complexity in both display and control re 
quirements, The compensatory display could 
be superior if the information contained in 
the instantaneous amplitude of the displayed 
error is sufficient to specify the 
which reduces the error to zero. As Cherni- 
koff and Taylor (1957) state: “The S would 
not have to take account of target velocities 
or error velocities nor would he have to pre- 


best simplest 


unburdening result in a reduction of 


response 


dict in any fashion.” Their data clearly sug- 
gest that the frequency of the cou:se forcing 
function and variations of control system dy- 
namics markedly affect the efficiency of com- 
pensatory and following tracking. 

To provide additional clarification, this 
study was undertaken to investigate the inter 
action between modes of information display 
and control system dynamics with a low fre- 
quency simple sine wave. Both compensatory 


and pursuit modes of information display are 


compared where the operator is provided with 
direct control of successively higher deriva 
tives of the control system output. 


METHOD 
Apparatus 


Display 


line and an inverted T presented on a 5-inch Dumont 


The display elements were a_ horizontal 
The horizontal line was the com 
mand index and was 2 inches long 
was the subject-controlled index. It 
horizontal line 1 inch long with a vertical line, .5 
inch long, bisecting the horizontal component. Both 
symbols were centered on the CRT and moved only 
in the vertical dimension 

When using the following display 
verted T 


The inverted T 


consisted of a 


mode, the in 


was controlled by the subject and the hori 





370 R. W. Obermayer, W. F. Swartz, and F. Vuckler 


zontal line was positioned by a course generator. The 6. The Control-Stick Count estimated the numb 
compensatory mode consisted of the horizontal linc times that the control stick was moved through 
being used as a reference on the center of the screen the stick deadband of plus or minus of an inch 
while the inverted T was driven by the difference 

between the course generator output and the con Subjects 
trol system output Nine adult males served as subjects in the experi 
Control. A center-positioned joy k was used. It ment. They were selected from a | 


is 1 inch in diameter and 22 ig. It 


road populatio1 
of professional pilots and engineers. In addition, th 
possible to move the top ol th nt SUCK ] vraduates and varied between 
irc of plu I nus 6 inches f: he stick’s 


position by forward or rward. movements, treatment treatment X subjects model of 
spectively. The control stick was spring loaded analysis of variance was used (Lindquist, 1956. 
that it would return to within of an inch ; 37 There were six treatment combinations as 
null position. A breakout force 5 pound was result of two ls of display modes and three 


quired to displace the stick fron im position vels of control dynamics. This design was used to 
rhe force-displacement gradient was linear and was 


increase the precision of the experiment by partition 
_— — y nalog deadband circuit was - # 
1 pound per inch. An analog deadband circuit wa ing the variance due to intersubject differences. An 


incorporated into the experimental apparatus to cor interesting by-product of this 
respond to plus or l f an inch of stick d } 


sibie eXamination ot t interaction ¢ 


} 
mance levels wit} 


design Vas the pos 


f whiect ' 
subject pe 
placement P 

icemen reatments 


A displacemer 


roportional posi 1on, I : Ai leratior I } Or Procedure 
trolled index, i. lvnamics, 

Each subject ¥ S pla 1 il i con 
grator, and twe tegrators, ‘ V 


npletely enclosed 
4 ‘ : mockup of a fighter-ty cockpit facing the CRT 
stick movement sulte 1 n inches 7 
. display which 
second, or ncne pel 1 I 


econd 


was approximately 28 inches from the 
. subject’s eyes. A low intensity white light was placed 
index di cement for position, ri and er: . ; : a 
bs inside the mockup to provide ambient illumination 
2 sip pe } ‘ The brightnes f the tht, fixed throughout the ex 
ilted in the sar tro idex di ment ; 
) 


ment. wa l t il 
second 


i illuminate the panel 

An instruction and practice period was held 
to administration of experimental treatments for 
h subject. The subjects were thoroughly briefed 
tu lowing and compensatory 
id position, rate, and 
system dynamics. The practic 
experiment g each treatment combination 
maximum amplitude of 2 tf modes d dynamics the yr two trial 


int » and i 


replicated witt random! prese! 


ction from zer 


Scorine Mea 


combinations of modes and 


There were 
bate —- P st per n each tre: ent com 
1i€@U) =Lfl rit { all i i 
1. The Average Absolut ; , : 
ister » each sul x trial was 
d error, was the aver ifference in inches , ; 
" 11n e il gtn ith second rest period b 
command 1d on ed n withou ‘4 ’ + oy | . 
ght on the panel was auto 
latically lig 1 warn the subject that 10 seconds 
emained before the beginning of the next trial. Dur 
he 3 second rest period, trial scores for each of 
dependent variables were obtained at the experi 


menter’s console. The over ill time for the 


i ft 


subject to 


complete the task was approximately 2? hours 


ince band o 1 r minus 1 1 ing f RESULTS 


e command 


. ; ee ere . ial Performance Score 
between the command ind led index \ total of 54 performance scores were ob- 
1S ( an inch 
Absolute Control Movement (AACM . . 

salen: te -Gidties il th treatment combination were presented to the 
iverage distance I icnes aiong e 


control stick handle that ntrol stick Subject; however, only the last five trials of 


than plus or ! : ; an ’ a 
a tained for each measure. Ten trials of each 


each combination were retained for examina- 





Information Displays in Continuous Tracking 


tion in order that stable data might be ob- 
tained from each subject. Retaining only half 
the total observations resulted in each meas- 
ure being based upon 270 scores 

Figure 1 presents the mean tracking per- 
formance data for the AAE and RMS meas- 
ures; each mean was based upon 45 trials. 
The treatments are ranked along the abcissa 
of the figure in ascending order of treatment 
mean error magnitude from left to right. In 
Figure 1, a decrement of performance is noted 
for each change of control system dynamics 








to a higher derivative. Further, performance 
with following mode ranked better than com 
pensatory mode for position control in both 
of the measures. However, this relation be- 
tween following and compensatory was re- 
versed for the rate and acceleration dynamics. id not 
Figure 2 presents the data for the TOT and however. 
Hits measures. TOT decreased in value as the The Control-Stick Count measured the num 
AAE and RMS increased in value. An exami ber of times that the control stick was moved 


nation of the Hits measure reveals that there through the null positi 


were fewer tolerance band crossings for the shown in Figure 
rate control than for either the position or a¢ trol is increased 
celeration control. It becomes apparent, then, creases slightly, then 
upon examining the TOT values, that the sub with acceleration control 
jects tended to be on target for longer periods appear from these data that 
per time for the rate control than for position Sponse was not in the 
control even though better TOT performance tioning movements, 
Control-Stick Count 


for 


was obtained for the position control. When ; 
compared to rate control, a relatively higher derivative ol control aynam 
number of hits for the acceleration control “/S0, while it can b 
jects could achieve ze 
a sinusoidal response 
case since otherwise 
trol-Stick Count wou 
cific measure of the 
response was taket 
with acceleration dynami 
many subjects took the 
quency, low amplitude 
termed ‘“‘dither’’) super 
quency, high amplitude signa 


The AACM, as showr 


creased as a direct fur 
derivative of control sy 
siderably less control stich 
with rate and acceleration dyt 
position control dynamics 


little difference in AACM 


of information display 





R. W. Obermayer, W. F. Swartz, and F. A. Muckler 








Fic. 3. Mean tracking performance data of each 


treatment for the AACM and Control-Stick Count 
measures 


Statistical Analysis 


{nalysis of Variance Results. The measures 
of AAE, RMS, and TOT were statistically ex- 
amined by means of the treatment X treat- 
ment subjects model of the analysis of 
variance (Lindquist, 1953). Prior to analysis, 
the data for each of the three measures had 
to be transformed to satisfy the assumptions 
of the analysis of variance. The appropriate 
transform for each measure was determined 
on the basis of the normality plot for the re- 
spective distribution. It was determined that 
the AAE and RMS measures should be trans- 
formed logarithmically (log AAE and log 
RMS) and that the TOT measure should re- 
ceive a tangent transformation tan 1.5 (TOT- 


TABLE 1 
ANALYSIS OF VARIANCI 
AAE, RMS, 


SUMMARY OF THE RESULTS FOR 


THE MEASURES OF AND TOT 


Significance level 
Measure 


Source RMS TOT 
Modes 

Dynamics 

Subjects 

MX D 

MxXS5S 


-ous at § 
1959) 
t 1° 


tion. It 


TABLE 2 
THE MULTIPLE RANGE TEST FOR THE M 
X< D INTERACTION OF THE AAE, RMS, ano TOT 
MEASURES 


RESULTS OF 


Measure Range comparisons 


AAE ce. <2 
RMS ‘ cr ce 


TOT , ; CR 


Note 


The underline denotes no significant difference at the 
5° level 


10). Upon completion of the transformations, 
homogeneity of variance for each measure was 
examined (Odeh & Olds, 1959). For each Y 
used in the analysis of variance there is as- 
sociated an s*. Using the procedure established 
by Odeh and Olds, an identical analysis of 
variance was conducted on the quantity log 
s°. By using this method, the distribution of 
each source of variance was analyzed sepa- 
rately with respect to heterogeneity. The re- 
sults of this analysis are presented in Table 1 
in conjunction with the summary of the analy- 
sis of variance results. 

In Table 1, the most interesting results are 
displayed in the Modes X Dynamics interac- 
is noted that the significant inter- 
action, whose distribution was homogeneous 
at the 5% level, has been obtained in each 
of three measures. A significant Dynamics » 
Subjects interaction was found, also, for all 
measures, but the Modes X Subjects interac- 
tion was significant for only the TOT meas- 
ure. Unfortunately, it cannot be determined 
whether the relation between display modes 
and control dynamics was also dependent 
upon subject characteristics since the M x D 
x S interaction was nontestable. 

Multiple Group Comparisons. Table 2 pre- 


sents the results of Duncan’s new multiple 


range test (Duncan, 1955) of the M x D in- 
teraction in form for each of the 
measures. All multiple comparisons of the 
treatment means were conducted at the 5% 
level. An underline denotes no significant dif- 
ference between those treatment means at the 
5% level; all other comparisons are signifi- 
cant. In addition, the treatments are ranked, 


summary 





Information Displays in Continuous Tracking 373 


by the magnitude of their means, in ascend- 
ing order of error from left to right. It is 
noted that there was no difference in the sta- 
tistical outcomes in the RMS and TOT meas- 
ures and very little between the AAE and 
RMS-TOT measures. The treatment ranking 
was the same for all three measures. 

Examining the following and compensatory 
modes for common conditions of position, 
rate, and acceleration dynamics in Table 2 
g..es statistical bearing to the data contained 
in Figure 1. Compensatory was nonsignifi- 
cantly different from following mode for po- 
sition and rate control, but was significantly 
superior to following for acceleration control 
in each of the three measures. 

Examining position, rate, and acceleration 
dynamics for common conditions of follow- 
ing and compensatory modes discloses that 
position, rate, and acceleration were each sig- 
nificantly different from one another for the 
following mode with position control being 
the best and acceleration the worst; however, 
only the acceleration control was significantly 
different from the position control for the 
compensatory mode. It is noted that these re- 
sults, too, were the same for all three measures 

Individual Subject Comparisons. Table 3 
shows the relationship of position, rate, and 
acceleration dynamics for each subject within 


rABLE 3 


COMPARISON OF PosiTION, RATE, AND ACCELERATION 


Dynamics FoR Eacu SuByECT 


the significant D x S interaction. Mean per- 
formance scores, when averaged over all sub- 
jects, indicated that a decrement in perform- 
ance occurred with changes from position to 
rate to acceleration control for each measure. 
This is demonstrated in Figure 1. However, 
Table 3 demonstrates that performance for 
each individual subject was not affected, in 
all cases, in the manner which is indicated by 
averaging over all subjects. The same results 
were not obtained for each of the measures. 


DISCUSSION 


It is interesting in light of the abundance 
of research proclaiming the superiority of the 
following mode of information display by a 
factor of 2 to 1 or more, that in no condition 
of this experiment was the following display 
found to be superior to the compensatory dis- 
play. Both minimum and maximum tracking 
error in this study occurred with the follow- 
ing display. This is taken as further evidence 
that with low frequency courses the subject 
needs little more than simple error informa- 
tion provided by the compensatory display. 
Indeed, it is perhaps surprising that the re- 
sults of this study were not more favorable to 
the compensatory display, since it is suggested 
by Birmingham and Taylor (1954) that in 
cases where a simple stimulus-response rela- 
tionship exists, the additional information 
provided by the following display may con- 
tribute to degraded system performance. 

Using a complex course at about the same 
frequency as the simple sine wave in the 
current study (2-2/3 + 4-4/9 + 6-2/3 cpm), 
Chernikoff and Taylor (1957) found rate 
control to be superior to position control with 
both modes of information display. In the 
current study no reliable difference between 
position and rate dynamics was found with 
the compensatory display, and rate control 
was significantly inferior with the following 
display. One would expect with a regular and 
highly predictable sine wave course that the 


benefits of unburdening would be more promi- 
nent than with a complex course. A possibly 
critical difference between the two studies was 


that the average course velocity in the cur- 
rent study was higher (Bowen & Chernikoff, 
1958). It would appear that there is reason 
to suspect that course variables exist, in ad- 





374 R. W. Obermayer, W. F. 
dition to course frequency, which interact 
with modes of information display and con- 
trol system dynamics. 

It can be computed that if the output of 
each subject were sinusoidal and of the same 
frequency as the course, zero tracking error 
could result for the course used in this study. 
Unfortunately, there were no data collected 
which would directly specify the form of the 
subject’s response; however, it can be inferred 
that the subject’s response was not totally in 
the form of simple corrective movements and 
since tracking error was quite small, it may 
assumed that a was 
closely approximated. While the form of the 


desired subject response is sinusoidal for each 


be sinusoidal response 


order of derivative control, the phase relation- 
ship between the desired control movement 
and the sinusoidal differ 
with control system dynamics. For accelera- 


course movement 


tion control dynamics, the desired subject’s 


of +} 


control movements are the 
exactly tl 


same sinusoidal 


form but in 1e Opposite direction 


from that which would achieve zero error with 
& 
‘“natural’’ con- 


position control dynamics (Ely, Bowen 
Orlansky, 1957, p. 15). The 
trol sensing was used for position control dy- 
the 
celeration « 
the 


Spragg (1955) suggest that 


the control sensing fo 
be termed 
Norris, 


t 
tO 


namics refore ac 
ontrol dynamics may 
“unnatural” and 
fact 
tracking the resulting move- 
both of the 
subject’s input and the course generating sys 
tem, the subject does not get feed 
and 
therefore, these responses may be less subject 


sensing. Green, 


due the 
in compensatory 
ment of the target is the result 
a clear 
back of the results of his own inputs, 


to influence by changing the display-control 
relationships. The following display does pro- 
vide separate display of the results of the 


subject 


subject’s responses; presumably, the 
would be well informed of the effect of his re- 
sponses, and, in addition, 


stick 


could compare his 


motion to the motion of the course 


symbol. It is, therefore, possible that the su 
periority of the compensatory display with ac 


in 


celeration control dynamics may be 


part 
due to the existence of an unnatural control 
display relationship with these dynamics 
Since there are few identifying character- 
it is 
different 


of the subjects of this experiment 


ittempt to correlate the 


S 


e 


vartz, and F. A. Muckler 
trends in performance found among_indi- 
viduals to subject characteristics. 
However, that the results of tracking studies 
may be dependent on the particular indi- 
viduals employed is not a new finding. Wals- 
ton and Warren (1954) in attempting to meas- 
ure a mathematical transfer function for each 
subject found that quite different tracking 
techniques were used by 
Muckler (1960), 
ing 


spec ific 


some individuals. 
for example, in investigat- 
performance with oscillatory 
that trends in the per- 
formance data depended on the individual 
subjects and that even reversals in the over- 
all trend may He concluded that the 
performance scores obtained were quite de- 


tracking 


transients, showed 


result 


pendent on the control techniques used by the 
individual subjects and that the task tech- 
niques must be considered in the interpreta- 
tion of the data. In the present study, three 
different rankings of dynamics were found 
among nine subjects, constituting six different 
that the 
is study might have been changed 
the 


statistical outcomes. It is clear, then, 
data of tl 
considerably by judicious sampling of 
nine subjects 

\ control system gain was arbitrarily chosen 
for each condition of control system dynamics 
used in this study. While 2 inches of control 
movement produced display changes of 1 inch 
2 inches per second, or 8 inches per second 
squared for position, rate, and acceleration 
dynamics, respectively, a wide range of these 
parameters might have been equally accept 
able for the purposes of this study. The con 
ditions of dynamics used, then, are best con 
sidered as control gain-dynamics combina 
There is no guarantee that the control 
gain-dynamics used were optimum; indeed, 
likely a function of display 
It is entirely possible, therefore, that 


results of 


tions. 


the optimum is 
mode 
the this experiment are not repre 
of 


| t 
{ 


considerable reduction in 


sentative all control gain-dynamics com- 
binations. in 


is suspected that the 


\ACM 


resents an attempt by the subjects to achieve 


part 
scores rep 
system stability by reducing his overall gain; 


however, since the amount of control move 
ment is also dependent upon the control gain 
little of 
The un 


additional 


associated with each control dynamic 
a concrete nature can be stated here 


woidable conclusion is that only 





Information Displays 


research can resolve the influence of control 
gain in this study. 


SUMMARY 


Nine skilled subjects performed single di- 
tracking with all combinations of 
following and compensatory modes of infor- 


mension 


mation display and position, rate, and ac- 
celeration control system dynamics. The forc- 
ing function was a slow frequency, simple sine 
wave course. Tracking performance was found 
to be dependent upon the interaction between 
display modes and control system dynamics. 
Compensatory tracking was equivalent to fol- 
lowing tracking for position and rate control, 
following 


but was significantly superior to 


tracking for acceleration control. For the com- 
pensatory mode, position and rate control 
tracking performance was statistically equiva- 
lent; however, both position and rate control 
were significantly superior to acceleration con 
trol. For the 


conditions were significantly different with po- 


following mode, all dynamics 


sition control demonstrating highest a at 


celeration control lowest performance levels. 


Examination of significant subject interac- 
tions with treatments showed deviation in in- 


dividual cases from the averaged group data. 


REFERENCES 


BIRMINGHAM, H. P., & Tayi 
gineering approach to the 
, 


continuous control syst 
1954, No. 4333. 


in Continuous 


Tracking 


Bowen, J. H., & CHeRNIKorr, R. The effects of 
magnification and average course velocity on com- 
pensatory tracking. USN Res. Lab. Rep., 1958, N« 
5186 

CHERNIKOFF, R., & Taytor, F 
frequency and aided time cor yn purst and 


tracking. J. exf ychol 57, 53 


course 


compensatory 


285 292 


Duncan, D. B. Multiple range 
Biometrics, 1955, 11, 1-4 

Ery, J. H., Bowen, H. M., & Of: 
machine dynamics. USAF WAD( 
No. TR 57-58 

GREEN, R. F., Norris, I 
Compensatory tracking 
SAM two 
directions and planes 
cranks relatiy 
Psychol., 1955 

LinpguisT, E 
mm p vcholog\ 
Mifflin, 195 

MvuckKLeER, F 


hand pursuit 


A. Mai 


with short period osci 
USAF WADD 


sients 


Poutton, E. C 
play s in tracking 
No. APRU-15 

SENDERS, J. W., & 
formance in 
USAF WAD( 

Watston, C. |] 


analysis <« 


nathematical 
closed loop 
Res. Cer 





Journal of Applied Psychology 
1961, Vol. 45, No. 6, 6 


CROSS-VALIDITY OF 


PROCEDURES 


FOR SELECTING 


LIFE INSURANCE SALESMEN 


PETER F. MERENDA,? WALTER \ 


CLARKE, anv CHARLES E. HALL 


Walter V. Clarke Associates 


An extensive study of personality and bio- 
graphical factors as predictors of success-fail- 
ure of life insurance agents has shown that 
both personality characteristics (Merenda & 
Clarke, 1959a) and personal history variables 
(Merenda & Clarke, 1959b) are related to 
success in selling ordinary life insurance. The 
previous findings have further demonstrated 
that the predictive efficiency of selecting life 
insurance salesmen can be enhanced by the 
combined use of both sets of predictor vari- 
ables. 

An earlier study of Clarke (1956b) using 
the same personality assessment instrument 
employed in this study revealed evidence of 
discriminatory power of the personality vari- 
ables in distinguishing 
producers among life insurance agents 
yond 2 years after hire. Another parallel study 
by Wallace, Clarke, and Dry (1956) failed 
to show the same significant discriminations 
between first 
unsuccessful 


between high and low 
be- 


and second year successful and 
agents. 


The study presented in this paper reports 


on the cross-validity of the prediction system 
used in the earlier validity study (Merenda 
& Clarke, 1959b) 


pendent sample of life insurance agents. The 


administered to an inde- 
main intent of the study was to determine 
whether the power of prediction found in the 
validity study (Merenda & Clarke, 1959b) 
would hold up on cross-validation. A further 
purpose was to obtain data which would help 
clarify the apparent discrepancies found in 
the earlier studies of Clarke (1956b) 
Wallace, Clarke, and Dry (1956). 


and 


The major portions of this paper were presented 
at the Annual Meetings of the Midwestern Psycho 
logical Association, St Missouri, April 29 
1960 

*Now at the University of 

Now at the Rhock 


Education 


Louis, 


Rhode 
Island 


Island 


State Department of 


METHOD 


this cross-validation 
financed life insurance 
ordinary life insurance 
January 1, 1955 and July 31, 
dation sample was comprised of 522 


Subjects. Subjects in study 
hired 


between 


were 535 male agents 


by an company 
The earlier vali- 
agents hired by 


1956 


this same company in the period between September 
1, 1950 and December 31, 1954 
Predictors. Three sets of predictors ot 
life insurance salesmen were employed with the vali 
The first 
assessment inventory 
was the Activity Vector Analysis (Clarke, 
or AVA, as it is most commonly The 
AVA was administered at the time of application for 
The 
posed of five personal-socia]l history items contained 
in the application blank completed by the 
prior to hire 
at the time of 


surance 


success as 
dation sample was 
ality 
ment 


1956a) 


a self-conce pt 


The 


person 
particular instru 
( alled 


the position of salesman second set was com 


salesmen 
4 third predictor of age of the 
application for the position of life in 


used in the earlier 


agent 


salesman was validation 
study. 
used 


Two sets of criterion standards were 
The first criterion was a Success-Failure dichotom) 


Criteria 


determined by combining dollar production with job 
within the the 
period after hire 


status company at end of a veal 


A successful agent (.\ 97) is one who 
1. Meets his Training 
or achieves $200,000 


Program 
production in his first year and 


Allowance juotas 
at least $300,000 in either his second or third vear; or 

Is advanced to a 
position within the company; or 


supervisory or 


management 


Leaves the company to become agent, su 


pervisor, or general agent of another company be 


fore the end of the third year if he achieves the pro 
above 


duction goals outlined 


An unsuccessful agent (.\ $38) is who 
1. Fails to 


whether or not he 


one 


reach the production goals outline 


remains as an agent with the com 
pany; or 


2. Has 


pany, or 


had his terminated by the com 


contract 


Leaves the insurance industry within a 3-year 


pe riod 


business volume 
the end of the 


The second criterion was the new 
(face value of insurance 
first, second, and third 

Procedure In the 


inalysis was 


policies) at 
years 


validity tudy, discriminant 


applied to the problem of providing 
between the 522 agents di 


maximum separation 





Selecting Life Insurance Salesmen 


TABLE 1 


RELATIVE 
FOR VALIDITY 


Number of agents failing to meet standards 


AVA alone 


Sample 


Validity 


Cross-validity 


the first criterion into 108 successful 


unsuccessful 


chotomized on 
414 
were made for the personality, 
The 


proved to be 


and salesmen. Separate analyses 


personal history, and 


age predictors discriminant functions for each 
predictor set 
With predictor, it was 
the validity study that applicants who were 
than 45 


successful 


significant 
found in 


statistically 
respect to the age 
younger 
vears old 


than 


than 25 or more proved to be 


substantially less those who were 
within this age group 

The the validity 
sample were applied to the 535 subjects in the cross 
validity sample. Applying the 
to the 5 


into 97 


discriminant weights derived on 


criterion standard 


sample of 535, the dichotomized 


Em- 


roup was 


438 ssful 
ploying the same cut-offs indicated in the 
study as 


successful and unsucce agents 
validation 
being the optimal discriminant 


greatest 


scores tor 
rejecting the 
successful 


proportion of 
the minimum of 
potentially successful salesmen, predictions were made 
the i 


validity sample 


potentially un 
agents at sacrifice of a 
as to subjects in the 
Also analvses were 


success-failure of cross 


made of the re- 
lationship existing between the range of discriminant 
scores on the it produ “tion 


and 


predictor sets and ager 
studied separately for the 


These 
the single ¢ 


personality personal 
estab 


paid production 


history variates cross-validities were 


lished against riterion of 
independently of dichotomization 
criterion. The 
predictor, however 


success-fail- 
the age 
not upheld by the data of 
Hence this variate was re- 
moved from the predictor set in the cross-validation 
procedure reported in this pape 


on the 


ure discrimination results on 
were 


the cross-validity sample 


RESULTS AND DISCUSSION 


The linear discriminant scores representing 
the AVA profiles yielded a mean of 12 
SD of 8.43 for the successful group, 
mean of 9.08 and SD of 8.55 for the 
group. For the total 
the AVA discriminant 


10 to +33. The linear composite score 


37 and 
and a 
unsuc- 
cessful cross validity 


sample scores ranged 


from 


distributions of the personal history variates 


PreDICcTIVE ErricrENcies oF AVA, PERSONAL History, AND AVA pLUs PH 
AND Cross-VALIDITY SAMPLES 


Percentage 
rejected 
by these 
standards 


PH alone AVA + PH 


vielded a mean of 108.64 and SD of 10.56 for 
the successful group, and a mean of 100.03 
and SD of 9.80 for the unsuccessful 
For the total sample, the personal history dis- 


group. 


criminant scores ranged from 70 to 134 
Table 1 presents the results obtained in the 
cross-validation of the predictive efficiency of 
the predictor sets. 
ots 
tential failure who failed to achieve at least 
AVA set 


on the personal history set 


employing multiple cut- 
\ny subject was considered to be a po- 
a discriminant score of zero on the 
and a score of 91 
of predictors 

It is noted in Table 1 that the 
prediction of failure of 
life 


unsuccessful at 


validity of 
a significantly great 
salesmen who 


the 


proportion of insurance 


end of 3 
vears in the validity sample is upheld by the 


actually 


were 


data of the cross-validity sample. While sume 
the 


successful agents who failed to meet the stand- 


shrinkage is noted in percentage of un- 


ards, a gain arose in the power of prediction 


A drop of 7 
points (from 17° to 10° 


of successful agents. percentage 
) was noted in the 
proportion of successful agents who failed to 
meet the standards 

It is 
ratio of 


further noted in Table 1 that the 


unsuccessful salesmen to successful 
agents was greater for the cross-validity sam- 


ple than in the validity sample. In the earlier 


> 


sample, 108 of the 522 agents were considered 
to be successful at the end of 3 vears, whereas 
in the later sample only 97 of 535 were suc- 
cessful in terms of the criteria of the study 
Iwo other significantly different features were 
noted: 
more 


the cross-validity 
the 


sample contained 


agents than former sample who 





P. F. Merenda, W. 


rABLE 2 


First YEAR PRODUCTION YIELD OF PRODUCTIVE AGENTS 


BY CATEGORIES OF THE AVA DISCRIMINANT SCORE 


Mediar Mean 


250.00 250.00 

201.95 3.49 
203.00 160.40 
160.95 135.15 
127.65 105.70 


156.25 
172.22 
130.77 
116.67 


# this table are 


nly those agent 


1 the score range 
luction in thousand 


failed to pass the personality criterion, and 
fewer who failed to pass the personal history 
criterion. 

Although fewer of the agents failed to pass 
the selection screen than was the case with 
the validity sample, a total of 121 unsuccess- 
ful agents in the cross-validity sample would 
have been rejected at the sacrifice of only 10 
who proved to be 3-year successes. Over the 
3-vear period these 121 unsuccessful agents 
sold a yearly average of $40,730 life insurance 
(face value). The 10 successful agents sold a 
yearly average of $340,000. The costs to the 
company in the initial recruitment, training, 
and financial support of these financed agents 
constituted a far greater investment than the 
3-year return from this group of 131 sales- 
men who would not have been hired had the 


rABLE 3 


YIELD OF PRODUCTIVE 
AVA DISCRIMINANT 


‘> YEAR PR 


TS BY CATEGORIES OF THE 


IDUCTION 


Scor! 


Media Mi t 

250 300.00 384.40 
150 5 288.89 333.00 
50- J&R. 89 287.20 
49 200.00 
~149 


263.75 


200.00 174.50 


Note Phe data of th 
vho were still employed at 
* The original discriminant scores w 
tant 10 in order to eliminate the use of 
| 


ated in the score range 
bP 


aid production in thousand 


Clarke, and C. E. Hall 


selection system been in effect at the time of 
hire. 

A further criterion in the form of paid pro- 
duction (face value of life insurance sold) 
was applied in the study. 


Personality Variables 


Tables 2—4 present the median and mean 
productions of agents who were actively en- 
gaged in selling insurance at the end of the 
first, second, and third years. In these tables, 
the sample subjects are classified according to 
AVA discriminant score categories regardless 
of classification on the success-failure criteria 
of the study. 

The data of Tables 2, 3, and 4 reveal that 
there is a high relationship between scores ob- 


rABLE 4 
PRODUCTION 


YEAR 
AGENTS BY CATEGORIES OF THI 


YIELD OF PRODUCTIVE 


AVA DISCRIMINANT 


THIRD 


Scor} 


Mediar Mea 


$16.67 


292.86 


$25.00 200.00 


350.70 217.25 
291.67 336.00 337.00 
225.00 243.50 167.90 


25.00 208.35 2 


tained on the personality profiles and agent 
production at the end of each of 3 years. Both 
the median and mean production yields of 
in selling 


agents who were actively engaged 
> 


life insurance at the end of 1, 2, and 3 years, 
respectively, appear to vary directly with the 
category of discriminant score ranges on the 
AVA. It is further noted in these tables that 
those agents with discriminant scores of 50 or 
above on the AVA sold, on the average, an 
amount of insurance which either closely ap- 
proached or exceeded the minimum acceptable 
standards of production set by the company 
department These 
end of the 
the end of either the second or 


agency standards 
$200,000 at the 


$300,000 at 


were 


first year, and 


third year. The data further show that those 





Selecting Life Insurance Salesmen 


agents with AVA profiles which were closest 
to those possessed by the successful category 
of agents in the validity study sold, on the 
average, substantially more life insurance 
than those whose personality profiles (as re- 
vealed by relatively low or negative discrimi- 
nant scores) were incompatible with the suc- 
cessful agents in the validity sample. 


Personal History Variables 


The cross-validity of the personal history 
predictor set is demonstrated by the data of 
Tables 5, 6, and 7. 
validity findings on the personality factors, a 


Again, as with the cross- 
near perfect relationship was found to exist 
the 
classification with 


and 
dis- 


productions 


between median/mean | 


category respect to 


120 ar 
110 
100 
Oo) 


SY 


criminant score range on the personal history 
Those the 


scores tended to sell 


set of variates. with lowest dis- 


criminant on the aver- 
age, considerably less life insurance than those 
with high scores. It is further 


data of these tables that the cut-off score of 


noted from the 


91 which was indicated by the results of the 
earlier study to be an optimal criterion seems 
to be confirmed by the cross validity findings 
with scores below this cut-off 


Those agents 


the average. less than the criterion 
amount of life 


vears. Also, by the end of the third year only 


sold. on 
insurance in each of the 3 


7 of the original 52 agents in this category 


were still employed by the company 


SUMMARY AND CONCLUSIONS 


An independent sample of 535 ordinary life 


insurance agents was used to cross-validate a 


selection system originally validated on an 


TABLE 6 
PRODUCTION PRODUCTIVI 


History 


YIELD OF 
THE PERSONAI 
DISCRIMINANT SCORE 


SeEcOND YEAR 


AGENTS BY CATEGORIES OF 


Median* Mea 


120 and al 
110-119 
100-109 


525.00 533.34 300.46 


299 00 323.72 222.89 
301.43 224.52 
7) 54 

99.00 180.27 


283.34 


90-99 Qi 73 


earlier sample of 522 agents 
The 
personality profile measured 
Vector Analysis and (0b) a 
sonal history variables obtained in a weighted 
application blank. The 


success-failure 


company predictor se 


live per 
criteria were 


year agent determined o1 
multiple criterion basis incorporating agent 


dol 


Ssoid 


production and company status and () 
lar amount of face value of life insurance 
Tl 


multiple cut-offs, i.e 


1e predictions were basis of 


pass either of the critical scores 1 the two 


predictor sets was identified as a potential 


failure. These cut-off scores were determined 


from the discriminant analysis results from 


> ie) 


the validity sample of 522 agents 
The data of this cross-vali lity 

firm the findings obtained in the ori 

The se are 

1. The AVA is a valid predictor: 


failure among life 


dation study 
insurance agents 
2. Certain personal history me 


1: 


valid predictors among life 


insur 


PABLI 


120 ar 

110-119 
100-100 
oO- 99 





380 P. F. Merenda, W. 


The predictive validity of one of the measures 


(age) was not, however, upheld. 

3. Combining AVA and personal history 
data enhances the predictive efficiency of 
these measures in determining success or fail- 
ure of life insurance agents over a sustained 
period of time. 


REFERENCES 


The construction of an industrial se 
J. Psychol., 1956, 41, 379 


CLARKE, W. \ 
lection personality test 
394. (a) 


Clarke, and C. E. Hall 


Crarke, W. V. The personality profiles of life insur 
ance agents. J. Psychol., 1956, 42, 295-302. (b) 
Merenpa, P. F., & Crarke, W. V. Activity Vector 
Analysis validity for life insurance salesmen. Engng. 

industr. Psychol., 1959, 1, 1-11. (a) 

MerenDA, P. F., & Crarke, W. V. The predictive 
efficiency of temperament characteristics and pei 
sonal history variables in determining success of 
life insurance appl. Psychol., 1959, 43, 
360. (b) 

Wattace, S. R., Crarke, W. V., & 
Activity Vector 
surance salesmen 
345 


agents. J 
365. 
The 

in- 
337 


Dry, R. J 
selector of life 


1956, 9, 


Analysis as a 
Personnel Psychol., 


(Received January 1, 1960) 





Journal of Applied Psycholog 
1961, \ 45, No. 6, 381-38 


EXPERIMENTAL EVALUATION OF 
FOR CONSOLE 


FRANK J. MINOR 
V achine 


International Business 


This study experimentally evaluated three 
binary code schemas in terms of operator cod- 
ing performance. The purpose of the evalua- 
tion was to aid in the selection of one of three 
alternative code schemas to be used on a data 
processing system console display. Informa- 
tion to be displayed in coded form would be 
a computer memory address displayed on an 
and the contents 
of that address displayed on a 


“address register display,” 
“character 
register display.” This information is required 
by the system programer to check the ac- 
curacy of a series of data processing instruc- 
tions which he has assembled to serve as a 
data processing program. In installations of a 
type such checking, or 
accomplished by executing via the system con- 


given debugging, is 
sole one program instruction at a time in its 
proper sequence. During this cycling process 
the programer makes careful note of the pro- 
gram instruction, the raw data, the data re- 
sults and their corresponding memory ad- 
dresses as displayed in coded form at the 
character and address 
registers. This information is checked by the 


appropriate console 


prograniing 
which if left uncorrected would result in in- 


programer to locate errors of 
valid data processing. 

The planned data flow logic of the system 
was amenable only to the three codes evalu- 
ated in this study, and would not permit 
modifications of the codes. The code schemas 
were: 

1. Biquinary Code 
Coded Decima 
3. Combined Code 


Binary 


Codes 1 and 2 are currently in use for dis- 
play purposes on data processing systems 
Code 3 is a workable combination of the first 
two codes, but is not currently known to be 
in use for display purposes. Because of the 
market for which this system was designed, 
transfer of training was not an issue in this 


study. It is not expected that programers 


and STANLEY L 


BINARY CODES 
DISPLAY 


REVESMAN 


Corporation, Endicott, New York 


trained for existing stored program data proc- 
essing systems would also perform program- 
ing for this new system. 

Of the three schemas, the Binary 
Coded Decimal was the most efficient in terms 
of cost and electronic simplicity for this spe- 
cific system. The difference in efficiency how 


code 


ever was not sufficient to justify eliminating 
the other 


This study was conducted to determine what 


two schemas from consideration 


performance differences could be expected in 
terms of programer’s performance at the con- 


sole as a function of these code variations 


METHOD 
Dependent Variables 


The criterion fo 
gramer’s efficiency 
racy of coding 
registers. The speed with am 
form the coding task is critical since the data 
essing system is nonproductive during the det ing 
procedures at the 


consol When the 
i 


cost of such a high speed data | 


hourly renta 
ocessing system is 
mak 


reduction in system downtime 


utilization of the 


considered, any 
for more efficient 
there is more than or 


Eact progran 


own programs 


juently 


0 a system 


The program debugging task 


a primary task of any one programer 


intermittent nature. However, total 


time due to debugging by all programer 
the system could significantly reduce 
tem computing time. The frequency 


by any one 


spe nt 
programer interrogating system for 
program debugging purposes will vary as a function 
of the number and complexity of new programs he 
ability 

short and 


written programs contain 


is required to write, his 
work methods 


grams, 


Barring very simple pro 


newly which 


errors 


require detection and correction 


Experimental Conditions 


Each of the three possible code schemas facilitated 


the coding of 48 characters. A character could be dis 
} 


played on the computer console by the illumination 


of one or more of a configuration of neon bulbs in 


the display register. For the experimental conditions 





Frank J, Minor and Stanley L. Revesman 


FFA AKK KAKA SY 





Binary Codes for 


the possible configurations of Uluminated neon bulbs 
were simulated with paper and pencil coding tasks 
The configuration of lights for structure 
was simulated by printed circles on the test sheets 
A description of each code condition is as follows: 

Code Condition 1 is an 11-bit (11 lights) Biquinary 
Code the 11 lights im- 
pulsed to denote a character. The code system as 
shown in Table 1 would be the basis for the charac- 
ter register display ; 
tem would be the 
display. 

Code Condition 2 
system of which a 


each code 


system of which 3 of are 


a modification of this code sys- 
basis for the address register 
is a 7-bit Binary Coded Decimal 
variable number of bulbs 
averaging 3.33 per character are impulsed to denote 
a character. The code system as in Table 
would be the the register 
modification of this code system 
the basis for the address register display 
Code Condition 3 
and 2 such that the 
Code 1 would be the 
display; the address register coding schema for 
would be the basis for the 


play (see Tables 1 


neon 
shown 
dis 


would be 


basis for character 


play; a 


Codes 1 
ior 
register 
Code 
dis- 


combination of 
coding schema 


basis for the 


character 
character 
address register 


and 2) 


Design 


A 3&3 experimental design with repeated meas- 
urements of independent groups was utilized. The 
three code schemas were f 
three separate trials 


analyzed under a series of 


Subjects 


females 


males, 22 
of the subjects 


None 


Fifty-five naive 
employed in the study 


subjects 


were 


O 
@°°c?® 


ad 


t 














OO000®@ 


BE 














Address Registers 


Fic. 1 





Examples of simulated display registers for 


‘ Console Display 


had had 
ber systems or codes such as those of the experiment 
Subjects were paid volunteers from a junior techni 
cal college (Broome Technical Community 
All subjects were drawn from a population of stu 
majoring in Business Technology. This par 
ticular population was selected to increase the homo 


geneity 


previous experience with nondecimal num 


College) 
dents 


of subjects on task relevant variables (pri 
marily clerical skills) and also to have subjects who 
were representative of the expected data processing 
system programers 

Two of the independent 


consisted of 18 


Schedu 


groups 
subjects each; the other group, 19 subjects 


ng limitations precluded a completely random a 


signment of males and females across groups. Conse 


quently, there was a disproportionate assignment of 
males and females to each group. A post facto anal 
the assumption of no 


task. To test 


in general ability level between the groups, ¢ tes 


sis, however, supported 


differences for the coding for equ 
the cumu 
The result 


between groups 


were conducted between groups based on 


lative scholastic average of each subject 


of the ¢t tests indicated no differences 


/ 1 pparatus 


Printed master code sheets which presented all let 
ters of the alphabet, the numbers 0-9, and 12 
characters made up condition 
The code sheets wert 


special 


were for each code 


used during training and test 
ing of the subjects 

Paper 
ment. These were 
lems which would occur 


and pencil tests were utilized in the experi 


designed to simulate coding prob 


on the data processing sys 
for 
each ex 


tem console registers Figures 1 and 2 


Three test booklets 


(see 


exam 


ples) were used under 


. 


& 
oats 
oO 
“gees 


Oo 


> 


a} 
D 
“ 


@ 


000e@0 
00e@0 














+: 


Character Registers 


Biquinary Code 





J. Minor and Stanley L. 
































Address Registers 
Fic. 2 


Examples of simulated displ 


booklet con- 
of 36 problems. Each problem is defined as 


perimental code condition. Each test 
sisted 
onsisting of a char: 4 ister ind an address 
egister 

In one-half the 
quired to encode by 


circles. In_ the 


problems the subjec 


placing an X in the appropriate 


remaining one-half of the 
the test items were presented with Xs already 


required to de 


probler 
in the 
printed circles and the subjects were 
code by placing the appropriate character on the a 
To control the level of 


swer line below the register 


difficulty of the test problems, all groups were given 
identical test problems which were structured in a¢ 
cordance with their assigned experimental code. Se- 
lection of the test problems to be used, and the ce 
termination of whether they were to be encoded or 
decoded, was arrived at by 


ignment 


random as 
Procedure 


Each grou 


phases ot 


juired to participa 


experiment. The two phas« 
ministered on two consecutive days. Phase 
one-hour group 


training Phase 2, admi 


( onsisted or a 


session 


tered on the following day, group 


one hour ng 


testing session 

Task ri 
registers and address registers in the manner in which 
such coding occurs in actual field use 


also approximately 


Subjects were required to code characte 


Programers in 
the field perform address register coding without the 


aid of a master code sheet since the task is com 
patible with established habits of hand decimal 


system numeric information. Character 


ing however is frequently conducted by 
master code sheets such as used in this 
code sheet is necessary because 


not perform coding tasks with 


many programers do 
ufficient frequency 
to be able to recall character code configuration 

The format for the master 
code condition was identical 
were presented ir 


code sheet for each 
Letters of the alphabet 


ilphabetic order in the first thr 


Revesman 

















cimal Code 


rows, and the digits 0 to 9 were presented in numeric 
ler in the This format facilitated a 
sheet for both 
regardless of the code 
apparent variation of this 
format which would improve the compatibility b 
tween the i 


fourth row 


ystematic organization of the code 


encoding and decoding tasks, 
schema. There was no 


experimental task id ven master 
code sheet 


lo perform er 
tions, the 
digit, or symbol on the code 
an X 


conditions, the 


subject would | e the 
heet and m: note 
the circles 


the three 


containing To decode under any 


subject systematically 
cated the character by first determining 
the four rows ft “arch in and 


which 
then locating 
character 

code sheets were 


Training. Master distributed 
+} 


the members of each group during the training 
sion. The code sheet corresponded to the unique cod 


condition assigned to the group. The organization oi 


the code sheet layout was explained to optimize 


irch behavior habits. Test sheets identical to those 


to be used in the testing phase of the study wer 
listributed to the subjects. An experimenter demon 


strated to the subjects 10 representative decoding and 
10 representative encoding problems. The demonstra 


f 


followed by practice trials by the subjects 
For each experimental condition, the same encod 


ion was 
ing and decoding problems were drawn from each ot 
the four rows of the master code sheet and presented 
to the 


was presented to each group and 


subjects as examples. One problem at a tim 
then checked for 
accuracy by both the subjects and two monitoring 

Before 


lem, each subject was required to correct any error 


experimenters continuing to the next prol 


If.the subject could not demonstrate to the 


menter that he 


exper! 
could perform the correction unaided 
the experimenter would again demonstrate the ge 
eral coding procedure him 


The criterion of success t 





Binary Codes for Console Display 


trained was that of accurate unaided encoding and 


decoding of 24 consecutive coding problen These 


problems were representative of all the types of prob 
lems to be presented in the test session. Because of 
the simplicity and highly redundant nature of the 
task, no subject under any 
following the first five practice problems for an 


, decoding a character 


condition required aid 


the problem types (i.e 
or address register, encoding a character registe! 
address register). The need for aid was no more fre 
quent under one condition than any other condition 

Upon completing the group training session the 
subjects were instructed that immediately 
the testing session, a demonstration of coding would 
be repeated. 


prior to 


assumed that subjects would not rehears« 


between the 


time of testing for the 


It was 


the task in different amounts time of 


training and the following 
task did not 
stimulus code configuration, and therefore there was 
no need for 
the tasks the subjects reached complete master) 
the task early during the training session 


reasons: the require memorization of 


rehearsal; because of the simplicity of 


Testing. As indicated above a bi 
ing was presented to the 


lel review oO! 
subjects 
ing. Testing consisted of three with one 
booklet per trial. Three sets of test booklets 
ind 3) were used, 


problems. To make a 


just prior to 


trials 


t consisting of diff it 


each st 


valid statistical test of the 


learning effect across trials, each trial required a ran 
dom assignment of the test booklets to the subjects 


within a given code condition, with the restriction 


that each subject perform on a different booklet on 


each trial. Subjects were instructed to complete all 


problems in a test booklet as accurately and rapidly 


as possible, and to signal their completion by a raised 
subject was re 


hand. The time required by each 


corded by the experimenters 








RESULTS 


A 3 X 3 analysis of variance for repeated 
measurements on independent groups was per! 
formed for both the time and error criteria 
(Edwards, 1957). 

Time Criterion Analysis 
The 3 


all significant 


3 analysis of variance showed over- 
among code condi- 
001). A test of 


s and code conditions 


differences 
tions and among trials (p 
interaction between trial 
was insignificant. 

The results of the time criterion are pre 
sented graphically (see Figure 3). The results 
of the analysis of variance are summarized i1 
Table 3. 

To detect 


where the differences in mean 


time occurred between paired conditions, mul- 


Time REO 
rH 3 Tt 





386 Frank J. Minor and 
tiple ¢ tests were computed by the Scheffé 
(1953) method. It was found that the time 
required to perform the tasks using Code 2 
was significantly less than the time required 
using either Code 1 or Code 3 (p < .001 and 
p < .01, respectively). The time required for 
Code 3 was less than that of Code 1, but the 
difference was not statistically significant. 
Similar Scheffé tests were computed for the 
mean time required under each trial where 
the data were pooled for the three code condi- 
tions. The results showed that there was a 
significant reduction of the average time re- 
quired with every successive trial (p < .01). 


Analysis of Error Criterion 


The 3 
over-all significant difference in the number 
of errors among the code conditions. How- 
ever, there was a significant over-all differ- 
ence in mean number of errors among trials 
(p< .01). The test of interaction between 
code conditions and trials was not statistically 
significant. 


3 analysis of variance showed no 


To detect where the differences in the mean 
number of errors occurred between paired 
trials, Scheffé sequential range tests were em- 
ployed. It was found that Trials 2 and 3 had 
significantly fewer errors than the first trial 
(p < .01 and p < .05, respectively). The dif- 
ference between Trials 2 and 3 was not sta- 
tistically significant. 

Additional analyses of the error criterion 
were performed. With the data of the three 
code conditions pooled, a ¢ test for matched 
groups showed that there were significantly 
fewer errors made in the address register than 
in the character register (p< .01). It was 
also found that there were significantly fewer 
errors made in the encoding process than in 
ii}. 

No one condition had significantly more 
in either the 
character or address registers. In addition, no 


the decoding process (p : 


errors than another condition 
one condition had significantly more encoding 
or decoding errors than another condition. 

The level of difficulty of the three sets of 
tests employed in the study were statistically 
checked. Neither the time nor error criterion 
significant differences between the 
three tests. 


revealed 


Stanley L. Revesman 


CONCLUSIONS AND DISCUSSION 


There were two major issues with which 
this study was concerned: (a) Which code 
system will permit the operator to utilize his 
code display registers with the most rapid 
rate of speed and result in the least number 
of errors? (6) With which code system can 
the operator improve his performance most 
rapidly? 

With reference to a, the results indicate that 
the Binary Coded Decimal system (Code 2) 
permits coding performance to be carried out 
at a rate of speed which is more rapid than 
either of the other two codes. It requires 25% 
less time when compared to the Biquinary 
Code and 20% less time 
the Combined Code. 


when compared to 


An inspection of time criterion data sug- 
gests that the greater proportion of differences 
in average time required between conditions 
was a function of character register coding. 
The rationale of this conclusion is that there 
was a significant difference between Code Con- 
dition 2 and Code Condition 3 although both 
code systems had the Binary Coded Decimal 
system used in the address register. It is hy- 
pothesized that the Biquinary Code systems 
require greater coding time as a function of 
the additional operator decision time neces- 
sary to determine whether a glowing numeric 
bit is to be read on the left or right side of 
that bit. 

It is believed that 
the Binary Coded Decimal would be a per- 


the time advantage of 


manent one. The rationale for this assump- 


tion is as follows: The coding task as per- 
formed at the data processing console by any 
one programer assigned to the system is not 
a high frequency task as compared to his ma- 
jor task of developing and writing 
grams. Consequently, a great many program- 
the 


code 


new pro- 


ers do not practice sufficiently with odes 


to memorize individual character con- 


figurations. Coding performance in the field 


therefore is usually conducted with the aid of 
a master code sheet, as simulated in the ex- 
periment. Because of forgetting, it is hypothe- 
in the field, the level of 
performance and the performance profile from 


sized, that coding 


one console coding session to the next would 





Binary Codes for Console Displ 1y 387 


be very similar to the performance profiles of 
this experiment. 

The analysis of the error criterion showed 
no significant statistical between 
the code conditions. This finding suggests that 
the rate of error would not be expected to 
differ for the three code conditions in field 
operations. 


differences 


With reference to b, concerning the rate of 
performance improvement, the significant re- 
duction in time required, and number of errors 
trials, indicates that improved 
performance did take place. Under no code 


in successive 


condition was learning rate more rapid than 
another condition. This sup 
ported by the statistical finding that interac- 
tion between trials and code systems was in- 


conclusion is 


significant. Therefore, the slopes of the plotted 
learning curves under each experimental code 
were near parallel for all practical purposes. 
The results of this experiment have prac- 
a code for a 
As stated ear- 


lier, the Binary Coded Decimal schema was 


tical implications for selecting 
data processing system console 
the most efficient in terms of cost and elec- 
tronic simplicity. The savings in operator cod- 
ing time at the console, which would ensue 
from the use of this code, further justifies 
its use. 
During program checking and debugging 
procedures at the console, the data processing 


system is nonproductive. Any reduction in 


system downtime constitutes an increase in 
system efficiency. There are many installa 
tions where new and complex programs must 
frequently be developed and checked for ac- 
curacy. It is in such installations that a 2 
25% 
sole can result in a practical increase in sys 
tem productivity. 


reduction in debugging time at the con- 


SUMMARY 


Three binary code schemas were evaluated 


experimentally in terms of coding 
performance. One of the three 


was to be selected for use on a data pro essing 


operator 


code s« hem is 


system console display The criterion for the 


code evaluation was the operator efficiency 
measured in terms of speed and accuracy} 
f coding console problems. An independent 


group of subjects was assigned to each of the 


three code conditions. One of the three « 


schemas facilitated a significant time 
of 20-25 


as compared to the remaining two 


7 here 


error between the 


code schemas were no differences in 


rate of three code condi 


tions. 


REFEREN( 
Epwarps, A. L. Experimental de 
research. New York: Rinehat 
ScHEFFE, H ] j 
the analysis of 


104. 





Journal of Applied Psycholog 
1961, Vol. 45, No. 6, 388 


SUPERVISORY PROCEDURES AND WORK-TEAM PRODUCTIVITY’ 


J. S. KIDD 


Ohio 


In spite of the intensive concern of social 
psychologists with the general phenomenon of 
leadership, only fragmentary data are avail- 
the 


the role specifications of 


interrrelations between 
the leader, the indi- 
vidual characteristics of the leader, the op- 
task setting, 
Ever since the Hawthorne studies 


able concerning 


erational or and team produc- 


tivity 
(Roethlisberger & Dickson, 1939) it has beet 
ized task ori 


system- 


that the supervisor of a 
influence 
and his 


productive output. Lewin 


able to demonstrate that the 
way which leadership responsibilities were 
carried out important 
termining many of 
(Lewin, Lippit, & White, 1939). 
context f 
(1956) h 


I 


was an factor in de 


aspects group behavior 
Again, in the 
Shartle 


corporate org inizations 


personal 


as pointed out that the | 


characteristics of the executive are likely to 
the 


determining 


formal role require 
the 


the leadership functions are 


predominate over 
manner in which 
fulfilled 


The present experiment was in no way in- 


ments in 


tended to bring 


+ 


closure to the complex ques- 


tions surrounding leadership role and team 


productivity limited purposes 

the relative influ 
to test 

supervisor) 


in efttect 


provide some insight into 


} 


individuals ‘vs. roles the hy 


that 


ence of 


pothesis differential role 
specifications can have upon team 
performance, to compare the effect of the dif 
ferent supervisory roles ot than 


more one 


class of team performance criteria, and to de 


lor . nideli — 
velop some guidelines to facilitate the man- 


agement of mu tiple team activities especially 


‘ 


those in which the team is to function in a 


man-machine system 


| This rest arch Vas 
Aviation Psychol 
the United 
6166, monitored by the A 
Division. Pe 


translation, publication, use, 


Laborato1 
ted in pa 
No 
espace Med 


suppo 
States under Contract 
616 


rm. 


l l 
Ion 1 


or for the United St: 


and in part b 


AND R. T 


State 


CHRISTY 


University 


METHOD 


Apparatus and Task Setting 
ivonn vas provided by the 
traffi 


observation was a thre 


[he general task en) 
’ 


simulation within the labor 


control center. The focus of 


atory of a radar air 


man team consisting of two operators and 


visor. This team was made responsible for 
ance of simulated aircraft through the prelimina1 
phases of a landing ap} h. The task involv 


pickup 
specified 
flight course 


ind alrspe 


and acknowledgment of aircraft entering a 
guidance of their 
altitude 


‘requisite to actual 


approac h route, 


raft for DY 
final phase 
puts were simulated. 
‘The simulation w: mplemented the 
developed OSU Electronic Air Traffic Control 
his built around an 


generating up to 


ing, and positioning the acceptance 


control agency for the ot th 


subsequent 
landing process A) 
by speciall 
oimu 
device, w analog 
is capable of dynamic 


targets and presenting them realistically to 
radar controller via cathode ray tube display 


ect manipulation of tl 
students tr 
In addition 

+} 


position available to the cont 


Di 


by couege 


“aircraft” is accomplished 
; 


faithfully act the part 
visual display of aircraft 
roller, he is in direct 
auditory the “ under 
jurisdiction through simulated radio channels 


of pilots 


communication with pilots” 


Experimental Variables and _ Performance 


Measures 


compared in 
traffic input 


» defined 


going 
ultation and wa 


estion 


er He wot 


I to anticipate 


Wh either 


ie Operation in 


1 
ouer er! 


t 
difficulti and detect 


the 


cont! 


idition ar supervisor was ve directive 
volved s s to permit 


instructions to the controller ir 

the situation 

This role differed from the 

that the supervisor was directed 
| t with the 


tion yy direct 
] controller to 


Thus, instead of reque 


the latte 
Direct 
ceding pri 
to take 
pilots 


r to correct 
ipant 


} 
mariuy in 


parti pre 


remedial conta 


sting the 





Supervisory Procedures and Productivity 


vsut some corrective he could take the 
corrective action at his own discretion through a di 
rect communication 


system efhci 

1ircraft are 

appropriate as follow 

as automat in the “chan 
yy . w Act 

nel” used by the super as that ~ 

used by the controller, 

tween supervisor and th 


troller’s jurisdiction was 


contact with the 
pilots. Coordination w l 


communication b 
i given con 
! i cont I A second me: 
These three conditio1 ve regarded as ¢ present st 
continuum of st I r 


taken 





. S. Kidd and R. T. Christy 





© Low Input Lood 
{ High input Load 


7 


Meon Per Cent Delay 





| | 
| | 
LZ a 


Loissez- Fore Active Monitor 





























Direct Participant 


Supervisory Role 


Mean percent flight delay under various con- 
ditions of supervisory role and input load. 


Subjects and Procedure 


Specially trained college students participated both 


as controllers and as supervisors. Six two-man con- 


trol teams were formed by random pairing from a 
12-man sample. The three supervisors each had more 
than 2.5 years experience in the laboratory as con- 
trollers and had filled various leadership positions 
during this time. The test 
that each supervisor 


arranged so 
had one opportunity to work 
with each of the teams. Thus, no 
perienced a supervisor in 
though, in going from team to team, the supervisor 
changed roles. A modified greco-latin 
was used so as to balance 


trials were 


control team ex 
more than one role, al- 
square design 
load, role, and supervisor 
factors with prac tice effects 
Table 1 

A given experimental session represented a particu 


The design is detailed in 


lar combination of one control team, 
and a role 


a supervisor, 
Each session was 3.5 hours in duration 
and incorporated both input load conditions. Eighteen 
such sessions were required to complete the design 
requirements 


RESULTS 


Three factors are of analytic concern: the 
influence of the supervisors as individuals on 
system performance, the influence of super- 
visory roles, and the influence of input load. 
All three factors and their interactions were 
compared in terms of the mean percent delay 
criteria. The remaining criteria were employed 
only with respect to the comparison of the 
effects of supervisor roles. 

The first step in analysis was an over-all 
comparison by analysis of variance of the 
three factors using the delay criteria. This 
was done primarily to assess the compara- 
tive variance contributed by supervisors as 
individuals vs. the variance contributed by 


the role assignments and as a check on inter- 
actions rather than as a direct comparison of 
the conditions within a given factor. The in- 
fluence of supervisors as individuals yielded a 
mean square variance 3.7 times greater than 
that derived from the role factor. The inter- 
actions of role and supervisor and of role and 
load were statistically negligible. The influ- 
ence of these factors may be seen directly by 
reference to Figures 1 and 2. The relation- 
ship between role and supervisor is of po- 
tential interest since the best of the three su- 
pervisors is also the least variable insofar as 
this can be assessed by the performance of 
his crews. 

Since the effect of interaction between the 
experimental variables was relatively minor, 
direct comparison of conditions within fac- 
tors was undertaken by nonparametric meth- 
ods. Table 2 presents the comparison of the 
three supervisory roles in terms of all four 
criteria of system performance. In all cases 
the analysis is carried out between pairs of 
conditions, since they represent discrete cate- 
gories rather than a continuous dimension. It 
will be noted that the laissez-faire role is su- 
perior according to the delay criterion but 
that active monitoring is superior with respect 
to detection of programed pilot errors and 
with respect to avoidance of positioning errors 
(in the final stages of a flight), while the di- 
rect participant role is superior in the avoid- 
ance of separation errors. 

The input load factor evaluated alone was 
highly significant. Under low load conditions, 





Supervisor A 


Co 
©) Supervisor 8 
& Supervisor C 


140 


120 7 


| 


3 100 


5” | 


F] 





| 
| 
| 








| 
| 
| 




















| 


| 

| 
1 | | 
1 | EE 1 1A J 
Direct Participant 


Lorssez-Foire Active Monitor 








Supervisory Role 


Fic 2 Mean 
individual 


different 
supervisory 


flight delay with 


taking 


percent 


supervisors various 


roles, 





Supervisory Procedures and Productivity 


TABLE 2 


[ue INFLUENCE OF SUPERVISORY ROLE 


the average percent delay was 97.8. while un 
der high load conditions it was 117.6. By the 
Walsh test, such a difference has a probability 


of chance occurrence of .031 


DISCUSSION 


} 
employ ed 


in the present experiment limits, to some de- 


The relatively small sample size 
3 

gree, the extent to which generalizations can 

be made. The major virtue of the experiment 

is perhaps in the fact that problems usually 

field 

into the controlled 


susceptible only to investigation were 
brought 
ronment. 

Thus 


given to the 


laboratorv envi 


validity can be 


proposition that individual su- 


some quantitative 


pervisor characteristics are more influential 


on work-team performance than are the vari- 
ous standard styles of carrying out the super- 
function. The problem remains, how- 
what are such personal 
Although there 


are supervisor 
selection te hniques available 


visor) 
ever, as to just exactly 
characteristics 
further vigor- 
ous study of the problem should be given 
added incentive by the present findings. 
The comparison of the supervisory roles, 


per se, leads to the proposition that there is 


ON VARIOUS 


MEASURES 


nertorn 
perio 


} . oe . 
t trade-oll eflect Detwee! 


Thus, the 


nonintervention feature 


a distin 


criteria laissez-faire role w 


cont! 


allows the 


to concentrate on maintaining a rapid 


| 
through the Svstem. a he controller apparel 


particular end 


hate 


accomplishes thi 
recklessness since 


relatively high. When the 
aggressively, errors al 


tain 
supervisor 
his role more 
but at a cost in the rapidity of 
management implication seems cleat 
procedure cle pe nds on 
importance of speed vs. accuracy i 

at hand 


demand 


mended supervisory 


Apparently, high levels of 
more overt intervention by 
pervisor. When both are equally import 


moderate level of intervention would seen 


most appropriate 
This conclusion should also be te mpered 


the recognition that novice controllers will re 


quire more active participation on the part of 


their supervisors so that minimal performance 
levels are consistently assured Thus, adop 


tion of the laissez-faire approach would de 


pend on both the nature of the task and the 
; 


level of achieved skill 


on the part of the op 


erators 





392 J. S. Kidd and R. T. 


In the present study, the supervised con- 
trollers were informally interviewed subse- 
quent to their participation in the experiment. 
Their comments were fairly consistent: they 
preferred the laissez-faire condition. Their re- 
sponse to the other conditions changed with 
time. At first, they were somewhat resentful 
to the supervisor’s “interference,” but as time 
passed they were more inclined to perceive 
the supervisor as at least well intentioned. 
Therefore, an effective supervisor might be 


fest to his team. 


one who promptly made his intentions mani- 


SUMMARY 


\ complex task setting provided by the 
simulation of a radar air traffic control sys 
tem was the context employed to evaluate 
the extent to which work-team productivity is 
modifiable as a consequence of different su- 
pervisors and supervisory procedures under 
different task loads. Six two-man teams were 
Each team worked under each of 
The supervisors shifted 


Three techniques were 


observed 
three 
from 


supervisors. 


team to team 


Christy 


adapted alternately: laissez-faire, active moni- 
toring, and direct participation. 

It was observed that the individual super- 
visor was a more consistent influence on per- 
formance than the particular role he em- 
ployed. The effect of role per se was signifi- 
cant but interpretable only in light of the 
particular performance criterion used. For ex- 
ample, processing speed was greatest under 
laissez-faire conditions, while error avoidance 
was superior under the conditions which re 
quired more overt supervisory activity 
load was not found to be an interactive factor. 


T ’ 
aSk 


REFERENCES 
R. K 


R., & Waite, Patterns of 
eh in experimentally created “s 
ial climat j Psychol., 1939, 10, 271-299 
HLISBERGER, F. J., & Dickson, W. J. Manag: 
ment and the worker. Cambridge: Harvard Univer 
Press, 1939 
SHARTLE, C. L. Executive performance and leader 
ship. Englewood Cliffs, N. J.: Prentice-Hall, 195 
S1ecEL, S. Nonparametric statistics for the behavioral 


New York: McGraw-Hill, 1956 


aggressive 
} 


science 


November 25, 19¢ 


(Received 





to 


u oj Applied Psychology 
1, Vol 


45, No. 6, 393-401 


BALES’ INTERACTION 
PERSONNEL SELE( 


DANIEL 


The purpose of this investigation was to 


of 
interaction process analysis were related to 


examine whether Bales’ (1950) categorie 


decisions made by personnel selection inter- 


viewers. 
It has 


cisons 


been genet! 


by 


leTally 


made interviewers are notoriously 


unreliable, in the sense that as ts made 


by 


li ts 
phcan 


sessmel 
two or more interviewers of 


differ 


the same ap- 


tend to The ques- 
to sources i 
} 


has not | 


tion as the 


however, een 


erature, except for Cast 


tion of various situatio 


systematic questioning 


In the absence of empiri supporte 


eralizations about sour¢ ‘rror, the prac- 


ft ittle more than 
inique which 


tising interviewer is le 
a 


a feeling of disdain toward a tec} 
idequate. 

the work re- 
by inter- 


DV 
in personnel selection are function 


is generally believed to be ii 


The hypothesis advanced in 


sions made 


ported here is that deci 
viewers 
= 


illy related to measurable dir 


nensions of in- 
The of 
Brunswick’s no 


(Bri 


terview process and content point 
view adopted is essentially 


tion of probabilistic functionalism 


1956). While this is admitted]; 

it 

one, since the more common assumption about 
that 


unsystemati 


wick, 


ple hypothesis, is nonetheless not a naive 


interviewing is interview process and 


chaotic. The 
frankly empirical, 
implying no strong predisposition to 


an 


content are and 


approach taken has been 


as the 
nature of interview process, but rather 


tempt is made at developing three 


variables whic h have some plausil ility 
1 Financed by Research Boa 

Grant No. 9435 Wel 

sity. Webster directed the re 

and John Kenyon assisted 


Ik 


searci 
with 
knowl dge S 
isted in pr 


memb 


iuthor gratefu 


erviewers as 


Personnel Selection Servic: 


PROCESS ANALYSIS OF 
“TION INTERVIEWS 


SYDIAHA 


ing func tionally 


variables are: (a) informatio: 
in a previous paper (Sydiaha 


Bales’ interaction process anal) 


here; and | interviewer ¢ 


ported in 


interview 
tion proc aSS al 


iT 
ik 


ig method because O 


1955; Maccoby New 

Swanson, Newcomb, & H 
uniformities i 
to 


nature 


irlle 
i? ; y y rn ‘ 
In inter: n pat 


son useful 


ic 
of 
differentiations have 

rect relevance to decision making 


h as the difi 


y 


oad +} ] 
social-emotional 


I he 


elsew he re 


interaction 
( Bales 
tinct 


“ae 
ire classified ir 


ion is n 
task” 

“social 

cee Bs 

is i 


noted that a dis 


which 


») and 


nd ] 


+, al 


6 J, and 
gories (1, 2 


re 


two broad divisions the 


of task acts into questions 


’) and attempted answers ( 


6. gives information, 
for opinion, 9 


} 


10WS tension, 1 


categories used in 
formation so as to re 


6A “ ’ 





394 


cial-emotional acts divide into positive (1, 2, 
3) and negative (10, 11, 12) acts. In addi- 
tion to these 12 categories proposed by Bales, 
two other were added for this 
study, namely, Categories 6A and 7A, these 
being subcategories of 6 and 7. Category 7A 


categories 


acts were those usually scored in ‘Category 7 
(‘asks for information, orientation, etc.”) 
but which are framed in such a way as to re- 
quire a simple “yes” or “no” response. Simi- 
larly Category 6A acts were those usually 
in Category 6 (“gives information, 
orientation, etc.’’) but they are expressed as 
simply ‘“‘yes” 


scored 


or “no.” The purpose of adding 
these categories was to test the consequences 
of what might be termed “directive” ques- 
tioning in which the applicant has little or no 
over the information 
but can only reply “yes” 
tions put to him. This pattern of questioning 
is in contrast to a more nondirective situa- 
tion in which questions are open ended, the 
applicant is given considerable freedom of 


control communicated 


or “no” to ques- 


choice in his responses and, in a sense, is able 
to exert some control over the interview. It 
seemed apparent, from informal observations 
made during preliminary stages of the study, 
that directive questioning tended to have one 
of two either an atmosphere 
was created of the interviewer cross-examin- 
ing the applicant, or it signified that the in- 
terviewer had lost control of the situation in 
that he had exhausted his supply of questions 
and was frantically searching for things to 
talk about. In either case, it seemed apparent 
that the situation was an uncomfortable one 
for both applicant and interviewer, and tended 
to be associated with rejection of the appli- 


cation. 


consequences: 


The following hypotheses were proposed, 
consistent with the above discussion: 


H,: Comparing interviews in which appli- 
cants are accepted with those in which appli- 
cants are rejected: (a) interaction scores are 
higher for Categories 1, 2, 3, 4, 5, 6, and (0) 
lower for Categories 7, 8, 9, 10, 11, 12, 6A, 
and 7A. 

H.: (Consistent 


interviewer error) 


with the notion of inter- 
Differences in interaction 
scores between accept and reject cases are 
greater for some interviewers than for others. 


Daniel Sydiaha 


Two studies were executed, designated Stud- 

ies I and II. 
Stupy I 

Procedure 

Training in the use of Bales’ interaction proce 
The method of scoring described by Bales 
(1950) was adhered to as closely as possible. Sound 
recordings of interviews were obtained by the ex- 
perimenter (the writer) and scoring of these inter- 
views was repeated until two consecutive scorings of 
the same interviews gave comparable results. Com- 
parability was tested by means of the chi square test 
as described in Bales (1950). To check on com 
parability of results, another scorer (A. Clark) re 
ceived training method, and 
marked discrepancies between the two scorers were 
discussed, and criteria for agreement decided upon, 
in light of Bales’ instructions. As an added check on 
scoring, the experimenter visited Bales at Harvard 
and discussed scoring problems with him.® 


analysis 


some in the scoring 


Training was carried out with a sample of 25 ir 
Upon completion of training, a sample of 
twice to test the repro 
ducibility of scoring. Time intervals between consecu 

weeks. For 
each scoring category, a correlation (Pearson prod 
uct-moment) was calculated between the two sets ot 
The results are shown in Table 1. The repro- 
ducibility of scores shown in Table 1 is high although 
the correlations are undoubtedly elevated by the 
recollection of the first scoring on the 


terviews 
67 interviews was scored 


tive scorings varied from 6 hours to 3 


scores, 


second Oc- 
casion. 
Subjects. On 
obtained from nine 
9-24 per interviewer. All applicants and interview 
ers were male. The included six Ca- 
nadian Army personnel officers, one Royal Canadian 
Air Force career counselor, one industrial employ- 
ment interviewer, and a psychologist engaged in pri- 
vate consulting practice. Five of the Army officers 
examined recruit applicants and the sixth examined 


interviews were 
with a range of 


tourteen 
interviewers, 


hundred 


interviewers 


3 Bales’ assistance and interest in this study ari 


gratefully acknowledged 


TABLE 1 
CORRELATIONS (Pearson Product-Moment) BETWEEN 
[wo REPEATED SCORINGS OF RECORDED INTERVIEWS 


\ 





Interaction 
officer The Ai 
amined applicants for both air crew 
The examined 
junior office and clerical applicants. The 
ubjects obtained from the 


applicants Force interviewer ex 
and administra 
tive positions industrial interviewer 


principally 
psychologist varied widely 
in terms of qualifications, from industrial apprentices 
and for 
sample was divided into “Junior’ 
The number of in ews in 
irom 9 to 15 


to senior executives, 


purposes of this study, 
the 


and “Senior” 


the 1 


groups groups 
varied 
The the 
applicants is in no sense to 
tive of 


selection of p of interviewers and 
be considered representa 
any population of 
but rath 


obtained from int 


selection interviews regu 
larly conducted, merely a sample of ac 
cessible interviews 


rviewers who 


could be persuaded to contribute data 


The 


mos 


for the study 


Conditions of interviews were 
“natural 


except t cases 


(approxi 
the 
except lo! 


mately 80% present in 


interview room 


five interviews, all interviews 


recordec 


1 
left 


Concealment of the 
the 
the cases the applicant 


sound recorder 


discretion of interviewer, and in 
did not kno 
a recorder in the roon 

Upon completion of each 


interview, 


viewer his 
the applicant should be accepted, rejected, or 
the 


recorded recommendation whether 
whether 


doubtiul. Because the 


as to 
case was marginal o1 
views were selected more or less at random from the 
stream of applicants examined, it was not 
that the 
all three categories or 


In some instances only a sin 


possible 
] the 


interviewers 


to ensure numbers of ci would be 


Same ior 
a particular decision cates 
made 
to combine the “ 
thereby including al 

Admittedly, this 
categories produced a 
the 
employment although 


variance could be 


cided ma 


gories, 


inalysis combining judgment 
juestionable division of cases 


since many of marginal cases » in fact ac 
cepted lor 
analysis the 
In defence of 
the 


iorm 


were classified as hav 


this procedure, it noted t 
criteria for employment were necessarily not uni 


differing 
interviewers 


because ot employment ettings in 


which the worked. Furthermore, 


Study I 
was a preliminary 
differ« 
decision categories Il 
as well as working ‘ 
{nalysis of data. Interact 
view were transformed into percentagt 


one only, and was designed to 


test whether found between any 


made use of consistent 
rejection 

for each inter 

figures in or 


der to these 


equate tor 
for 
complex analysis of variance with double classifica 
tion. In other words, a separate analysis of variance 
table was each of the 13 categories 
Category 9 was excluded from analysis because there 
were too 


interview duration, and per 
centage figures each ci were analvzed by 


prepared 


for 
few acts in that category to warrant analy- 
analvsis of table, 
columns represented sepa- 
groups, and the rep- 


sis.) In each variance rows repre- 


sented decision categories, 
rate interviewer interactions 


inalysis of Selection Interviews 


Mi 


resented interviewer-decis interactions 
Since all frequencies were 


to make 


necessary 

liv nt hy t} } f octed 
adjustments by t ne ol expected 
qual frequencies, as described 


Other variables zed were 


Ferguson (1959 
duration o 
combined sc: 
12. Mean 
individual 
components at 1 Table 2. F 


of e following 


squares for row, colt 


imn 
interview ratios 
calculated 


were for al 
McNemar (1 


ratio 


eractions 
Following 
for the F 


square 


denominators used 


were: individual 
to test for the 


interactions 


interviewers mean 


significe eractions, and 
P 


mean 


efiects 


square to tor the signifi 


ot main 


Results 


Decision category differences. Significant 


differences between decision categories were 


Categories 1 (shows 
3 (agrees), 1+ 2+ 3 (positive 


tional acts), and 10 


obtained for support ) 


SOK ial emo- 
Mean cate- 


gory scores, not presented in this paper, were 


(disagrees) 


consistent with expectation 


“accept” inter- 
views i 


high 


categories. 


were characterized by scores in 


positive social emotional and by 
low scores in (¢ 

It should be 
gories, three 


among 


‘ategory 1 
noted that of these four cate- 
also differ- 
i. 3 


For these categories, then, 


showed significant 
ences (Categories : 


interviewers 


and 1+ 2+ 3). 





396 


mean scores are not comparable for all inter- 
viewers. 

Interviewer-decision category interactions. 
Significant interactions were obtained for 
Categories 4 direction), 8 (asks for 
opinion), 12 (shows antagonism), 4 + 5 + 6 
(problem solving attempts), and 10+ 11 

12 (negative social emotional acts). Mean 
category scores were again consistent with ex- 
pectation, although there were some reversals. 
In general, “accept” interviews tended to be 
characterized by high scores in Categories 4 
and 4 -+ 5+ 6, and by low scores in Cate- 
gories 8, 12, and 10 + 11 + 12. 

These results imply that decision making is 
related to frequency of occurrence of acts in 
these categories, but not for all interviewers. 
These categories suggest sources of interview 
process which account for interinterview error. 

Of the five categories, two also showed sig- 
nificant differences among interviewers (Cate- 
gories 8 and 4+ 5+ 6). This implies non- 
comparability of scores among interviewers, 
although these findings have no special sig- 
nificance since, in any case, all significant in- 
teractions imply noncomparability of scores. 

Interviewer differences. Significant differ- 
ences between interviewers were obtained for 
Categories 2 (shows satisfaction), 6A (gives 
information in “yes” or “no” terms), 7 (asks 
for information), 7A (asks for information 
with questions to be answered as “yes” or 
‘“no”), 11 (shows tension), duration of inter- 
view, and 7+ 8+ 9 (questions). These re- 
sults are of no special significance, however, 
since they are not related to the decision cate- 


gories. 


(gives 


Discussion 


Hypotheses 1 and 2 were confirmed: some 
of the Bales scoring categories were correlated 
with decisions made by personnel selection 


interviewers. In general favorable decisions 
were characterized by high scores in problem 
solving attempts and positive social emotional 
acts, and by low scores in questions and nega- 
tive social-emotional acts. However, the dif- 
ferences between decisions were not uniform 
among interviewers. 

In general, the results of this preliminary 
study showed promise of depicting something 
about the nature of interview process, with 


Daniel Svdiaha 


regard to uniformities both specific and com- 
mon to interviewers. 

Demonstrating the presence of a relation- 
ship between categories and decisions leaves 
unanswered the question of the magnitude of 
the relationship. Because the interview groups 
in Study I were too small to permit the calcu- 
lation of correlation coefficients between vari- 
ables, Study II was conducted with larger 
numbers of interviews per interviewer to per- 
mit such estimates of degree of relationship. 
Furthermore, all were obtained from 
one organization (the Canadian Army) in or- 
der to standardize the criteria of acceptance- 
rejection used. 

Study IT was also set up to examine the im- 
portance of interviewer and applicant scores 
considered separately. Up to this point, each 
category score had included the combined acts 
of both applicant and interviewer and the re- 
sults considered in terms of the relevance of 
interaction scores to decision making, regard- 
less of which participant contributed acts to 
the conversation. Separate participant scores, 
on the other hand, are related to questions of 
individual role performance as it relates to 
decision making in the interview. 

Some evidence in support of distinctive ap- 
plicant and interviewer profiles was obtained 
from the data in Study I, but it was not in- 
cluded in Table 2 because many of the cate- 
gory scores were found to be heterogeneous, 
thus rendering use of the F ratio somewhat 
tenuous. Analysis of the data using Fried- 
man’s nonparametric analysis of variance 
(see Ferguson, 1959) yielded differences be- 
tween decision categories for some scores, but 
since the statistical procedure does not assess 
interaction effects, the data were not consid- 
ered comparable to those shown in Table 2, 
and are omitted from this paper for that 
reason. 


cases 


Stupy II 
Procedure 


study are described elsewher« 
(Sydiaha, 1959). Each of eight Canadian Army 
Regular Force personnel officers interviewed from 
13 to 50 Regular Force applicants for the Canadian 
Army. Total N for the project was 256, but data for 
Bales’ data were missing in 6 cases. Interviews were 
conducted under usual Army circumstances except tor 
additional research material requested for the stud 


Details of the 





Interaction Analysis of Selection Interviews 


interviews were sound 
’ rded TI » re li , } ~< |} ) i ] > 
recorded re recording Machine were In plain view 


ies herein reported, and the 


of all applicants, and they were informed that the 


recording was to be used for research purposes. Only 
the applicant and the interviewer were present in the 
| 


Each case was assessed 


assified the 


room during each interview 


one officer, who applicant as 


by only 


either Rejected cases in 
cluded 20 
a later date and 7 applicants referred for psychiatric 
included 29 
Army service 


into three 


“accepted” or ejected.” 


+ 


applicants recommended for assessment at 
ssessment applicants 
considered marginally 
The cases were divided 
to permit cross-validation of findings. Cases provided 
Officers A, B, F, and G (N 37, 50, 50, 41) 
randomly group (N 


Act ept d_ cases 
suitable for 
separate 


groups 


were assigned to a criterion 
88) and a holdout group (V = 90). The remaining 
Officers C, D, E, and 


second holdout 


cases, ie., those provided 


H (N 13, 23, 18, 18 
group (N = 72 
Scoring was performed sound 


from disc recording 


Study I. For 
converted to 


by the experimenter (the writer) as in 


each interview, interaction scores were 


scores 


interviewer 


effects of the 


percentages for the applicant and 
partial out the 


each of the 


separately in order to 
total amount 
These 


eral 


spoken by participants 
The gen 


statistical procedure followed was to combine al 


were treated as separat scores.) 


scores mathematically in su a way as to maximize 


the correlation between combined scores and a 


ceptance-rejection for the criterion and the 


group 
coring key holdout 
to test the 


rejection 


was then applied to the 
relation 
such 


group 


between scores and acceptance 
combined (referred to 


developed: the “Ay 


ipplicant 


Three scores 


henceforth as “indices”) wert 
plicant 
“Interviewer Index” 
and the “Interview Duration Index” 


the total number of acts scored for 


scores), the 


Index” (made up 
made up of interviewer scores 

(made up ol 
each of the tw 
participants and the dur n of the interview in 
minutes 


The 


the Interview 


used to 
Duration Index. This 


discriminant function was make up 


procedure was 


not felt to be justified for making up the other tw: 
} } , 


indices, however, becaus« large number of 


variables involved. Guilfor 54), for example 


has pointed out the inadvisabili f multiple r 


gression analysis for combining test scores when thei: 
number is high. Consequently these indices wert 
made up by the standard error 

Gulliksen (1950) 
a function of the standard deviation and 
Variables w hic h 
negatively with acceptance-rejection (for the criterion 
assigned negative weights. After all three 
combined into 


func 


method of weighting, 
described by whereby scores are 
weighted as 
the reliability coefficient correlated 
group) were 
indices had been calculated, they were 
a “Total 


tion 


Index” by means of a discriminant 


As was indicated in the results of Study I above, 


interaction scores are not uniform for all interview- 
ers. To assess the consequences of such interviewer 
differences in Study II 


selection 


two separate criteria of item 


were used, which, in turn, lead 


criteria 
oan ple s A 


combined Offi 


‘ s, and two sets mn i two 
acceptance-rejection for individual Officer 
3, F, G; acceptance-rejection for the 
cer Sample A+ B+F+G. The 
in the use of these two criteria was that, on the one 
hand, items correlated 
individua 
pecific t 
acceptance-rejection lor the 
vould reflect 
Depending upon the relative proportion 
cif and 


data, the following results might be 


reasoning involved 


with acceptance-rejection 
officers would reflect crite: 

each officer, whereas items 
combined 


criterion variance common 
common-criterion variance preset 
expect 
li criterion variance 
then correl 


cur: (@) common to 


were zero, indices 
acceptance rejection would combine 
holdout samples (A+B 


H). In 


bined sample correlatior 


other words, 


terion variance commor! 

terion variance specific to officers were z 
relations between 
would be the 
based 


scoring keys. In 


indices and accept 


regardless of whether indice 


Sam< 


upon divid ly base r grou 


were 


I 
t 
i 


' 
ation di 


between indices two 
be indicati 
pecific officers 
Consistent with such 
adopted were as 
Individual scor 
related with acceptance 
riterion 


scoring weights derived 


officer ¢ groups 
applied to eacl 
corresponding officer holdout groups. This 
vas intended to maximize criterior 
to the four officers 
Group scoring keys: All 
standard 


variabl 
score form, based upon 
h officer 
acceptance-re)}« 
namely, A 


obtained for ea 
elated 
combined crite 
The scor 


holdout 
oldout 


scores 


then cor with 


rion group, 
g weights derived were applied to 

groups, namely, A+ B+ F 

H. This procedure 


Variance 


Results 

Group scoring keys. 

Applicant indices: Correlations between ap- 
plicant indices and acceptance-rejection were 
31 and .23 for the two holdout groups (see 
Column 4, Table 3) 
statistically different 
(Point-biserial 


These correlations are 


from zero (p < .005) 
were tested for 
statistical significance by the method de 
scribed in Ferguson, 1959.) The difference 
between these correlations was not significant, 
thus making it possible to calculate a correla- 


correlations 








‘ 


Daniel Svydiaha 


TABLE 3 


CORRELATION (Point-Biserial) BETWEEN 


Usinc INpIVIDUAL SCORING 


ACCEPTANCI 


Keys 


AND THREE BaLes’ INDICEs, 


KEYS 


REJECTION 
AND GRouUP SCORING 


(Study IT) 


4 


Applicant 


Inde X 


ISK GSK 


Sample 


G 
H 
B 
D 
All cases 


tion based on all cases, which was .28. Dif- 
ferences among the seven officer-sample cor- 
relations shown in Column 4 of Table 3 were 
not significant (p > .9). (Correlation differ- 
ences among officer samples in this paper 
were tested by the method described in 
Snedecor, 1956.) 
Interviewer indices: Correlations between 
interviewer indices and acceptance-rejection 
were .34 and .25 for the two holdout groups 
(see Column 6, Table 3). These correlations 
are statistically different from zero (p < .005 
and .025, respectively). The difference be- 
tween these correlations was nonsignificant, 
and the correlation coefficient for all cases 
combined .29. Differences among the 
seven officer-sample correlations shown in 
Column 6 of Table 3 not significant 
(p> 8). 

Interview duration indices: Correlations be- 
tween the interview duration indices and ac- 
ceptance-rejection were .08 and .19 for the 
two holdout groups (see Column 8, Table 3). 
These correlations are not statistically differ- 
ent from zero, as determined by the test of 
the significance of multiple correlation, de- 
scribed by Ferguson (1959) 


was 


were 


/ S Q 10 

Interviewer 

Duration 
Inde x 


Three indices 
combined* 


Interviewer 
Index 
ISK GSK 


ISK GSK 


43 


50 


42 


39 


Three indices combined: Correlations be- 


tween the three indices combined and accept- 


ance-rejection were .37 and .32 for the two 
holdout groups (see Column 10, Table 3). 
Using the test for the statistical significance 
of multiple correlation, only the first correla- 
tion is different from zero (p < .01). How- 
ever the two correlations are not significantly 
different, and the multiple correlation for all 
cases combined is .35, which is nonzero (p 

.01). Differences among the seven officer- 
sample correlations shown in Column 10 of 
Table 3 were not significant (p > .9). 

The results tend to support Expectation a: 
With the exception of the interview duration 
indices, all measures were correlated with ac- 
ceptance-rejection. The scoring procedures 
used were successful in eliminating interinter- 
viewer differences in interaction scores, while 
at the same time, demonstrating consistent 
interaction process common to all interviewers. 

Individual scoring keys. 

Applicant indices: The average correlation 
between applicant indices and acceptance-re- 
jection was .32 compared with .31 for indices 
based on the group scoring keys (see Columns 
3 and 8, Table 3). 





Interaction Analysis of 


Interviewer indices: The average correla- 
tion between interviewer indices and accept- 
ance-rejection was .62 compared with .34 for 
indices based on group scoring keys (see 
Columns 5 and 6, Table 3). The question as 
to whether the difference between these cor- 
relations is significant could not be answered 
directly and a simplifying assumption was 
made in order to obtain an estimate of sta- 
tistical significance. The reason that a direct 
test of significance could not be made in this 
instance was that a correlation between the 
two sets of indices was required in order to 
test the correlation differences since both cor- 
relations were based on the same subjects 
(see Ferguson, 1959). Such a correlation be- 
tween indices was not feasible in this case 
since there differences in mean 
scores among the four officer samples based 
on the individual scoring keys. 


were wide 


The simplifying assumption made was that 
the correlation between the two indices was 
zero, which represents a conservative esti- 
mate of the true state of affairs. Using this 
assumption, the difference between the two 
correlations (.62 vs. .34) was statistically sig- 
nificant (p < .05). 

Considering individual officer samples in 
Columns 5 and 6 in Table 3, correlation dif- 
ferences were not significant in any case, al- 
though the small sample sizes involved would 
militate against demonstrating any significant 
results. 
duration indices: The 
correlation between interview duration indices 


Interview average 


and acceptance-rejection was —.04 compared 
with .08 for indices based on group scoring 
keys (see Columns 7 and 8, Table 3). 

Three indices combined: The average cor- 
relation between the three indices combined 
and acceptance-rejection was .46 compared 
with .37 for the indices based on the group 
scoring keys (see Columns 9 and 10, Table 3). 

The results obtained from the interviewer 
scores tend to support H,: Indices based on 
individual keys account for more criterion 
variance than do indices based on group keys. 
Applicant and interview duration indices do 
not yield different results when comparing in- 
dividual with group scoring keys. 


Selection Interviews 


DISCUSSION 


Hypothesis 1 was confirmed: Bales’ inter- 
action process analysis scores were correlated 
with decisions made by personnel selection 
interviewers. The correlations obtained were 
modest in magnitude, although they were uni- 
form across all interviewers, provided the data 
were transformed into standard scores. Tak- 
ing the correlation coefficient of .35 based 
upon all cases and combining all categories it 
may be concluded that approximately 12% of 
the criterion variance is accounted for by in- 
teraction score variance common to all inter- 
viewers. 

Confirmation of Hypothesis 2 was confined 
to interviewer conversation only: interviewer 
differences in interaction scores between ac- 
cept and reject cases were found only for in- 
terviewer indices, not for the applicant and 
interview duration indices. Taking .62 as be- 
ing the correlation between interviewer indices 
and acceptance-rejection we may estimate the 
criterion variance thus accounted for to be 
38%, which presumably includes both com- 
mon and specific variance. By subtracting 
the estimated common variance (12°), men- 
tioned above, it would appear that criterion 
variance specific to interviewers is of the o1 
der of 26%. 

It should be understood that these variance 
estimates are, at best, very rough, since the 
Table 3 
very small samples, and there may undoubt- 
edly be differences among interviewers in the 
magnitude of criterion variance accounted for 
by specific interaction How- 
ever, as a first approximation, it may be con- 
cluded from this study that such specific in- 
teraction 


correlations shown in are based on 


score variance 


variance accounts for inter 


interviewer error to a significant degree. 


score 


It should be pointed out that the occur- 
rence of such interviewer differences precludes 


the making of definitive statements as to 


“typical” interaction scores in employment 


interviews. While mean scores in Study II 
tended to be consistent with the results of 
Study I (‘“accept” interviews tended to be 
characterized by high scores in positive so- 
cial-emotional, and problem solving cate- 
gories, and by low scores in questions and 
negative social-emotional 


categories) these 





100 Daniel 
trends were somewhat incidental. Mean scores 
varied considerably among decision categories 
as well as among interviewers, and reversals 
in the direction of differences 
common. 

The question remains as to the implications 


mean were 


of 
Clearly, this study has dealt with the assess- 
ment of the concomitants of decision making 


these findings for interview techniques. 


rather than with manipulable antecedents, 


such that any interpretation must be con- 


sidered speculative, pending confirmation by 


experimental manipulation of relevant circum- 
stances. At least two interpretations of the re- 
sults are apparent: 

1. Interaction may reflect 
valid dimensions of job performance such as 
interpersonal skills. Interinterviewer differ- 
ences in interaction process scores would cor- 


process § S¢ ores 


respond to differences in sensitivity to such 
ei] 
skills 


(A follow-up is now under way in which 
will be correlated 
based the in- 


following 3 


interaction process scores 


with criterion measures on 


ductees’ performance records 
years regular Army service.) 


? upon whether 


2. Depending interaction 


process scores are correlated with docu- 


mentary and other descriptive information 
presented during the interview, these results 
would throw some light on the possible im- 
portance of nonrational processes in decision 
making. To illustrate this point, let it be as- 
sumed that interaction process scores and de- 
scriptive information are uncorrelated. Then 
that are 

with 


tact 
correlated 


interaction scores 
would 


imply that the interviewer is sensitive to in- 


the process 


acceptance-rejection 


fluences which are beyond the realm of data 
gathering. and may possibly overlook infor- 
mation in reaching his decision, by reacting 
to a “favourable atmosphere.” Obviously, if 
the existence of such an atmosphere is irrele- 
vant to his task as formerly defined, then the 
be subject to error in his 


interviewer may 


decisions. Particularly if it could be shown 
that applicant subjects could establish a fa- 
vorable atmosphere by “role playing” cer- 
tain interaction categories, these results would 
serve to demonstrate some of the pitfalls of 
interviewing technique 
On the other hand, if 


scores were found to | 


interaction process 


be correlated with de- 


Sydiaha 


scriptive information, then the results ob 
tained in this study would be of little conse 
quence, since this would imply that interac- 
tion scores merely reflected the consequences 
of favorable unfavorable information 
(whichever the case might be) divulged dur- 


or 


ing the interview. 

The data presented here cannot settle the 
issue as to which interpretation is more plau- 
sible. One appropriate test of the alternatives 
would involve having experimental applicant 
subjects attempt to “role play” or ‘“fake’ 
certain interaction categories in order to bring 
about favorable decisions on the part of in- 
terviewers. It may be worth emphasizing that 
the data discussed here are entirely consistent 
with the notion that an applicant can fake his 
way favorable (The data are 
also consistent with the notion that interview- 
ers respond to such faking in terms of differ- 


to a decision. 


ent interaction profiles.) But such interpreta- 
tions must wait on further investigation. 


SUMMARY 


Samples of interview conversation (person- 
nel selection interviews) were analyzed ac- 
cording to Bales’ interaction process analysis. 
Scores obtained were correlated with decisions 
interviewers about whether appli- 


for 


made by 


cants were recommended acceptance or 
reje tion. 
In Study I, 


interviewers (10 groups of cases ranging from 


data were obtained from nine 


9 to 15) and scores were submitted to analysis 
f variance procedure. Some significant differ- 
ences in interaction process scores were found 
between decision categories, and some inter- 
viewer-decision interactions also. sta- 
tistically significant. 

In Study II, data were obtained from eight 
Canadian Army personnel officers who inter 
viewed from 13 to 50 applicants for the Army. 
interaction 
e-rejection were calculated using 
item analysis and cross-validation procedures 

The results indicated that: (a) Bales inter 
ction process categories are correlated with 
acceptance-rejection, the correlation being low 
but consistent for transformed data; (6) in- 
terinterviewer differences in interaction proc- 


were 


Correlations between categories 


and acceptan 


ess confined to interviewer conversation 


are 
only. 








Interaction Analysis of Selection Interviews 101 


The implications of the results are dis 
cussed, in terms of the possible significance 
of applicant faking of favorable interactions. 


REFERENCES 


analysis: A method 


Bates, R 1 
Addi 


F. Interaction proce 
for the study of small groups. Cambridge 
son-Wesley, 1950 

BruNSWICK, E. Perception and 
of psychological experiment nd ed 
Univer. California Press, 1956 

Fercuson, G. A. Statistical analysis in psychology 
and education. New York: McGraw-Hill, 1959 

Guirtrorp, J. P. Psychometric method nd ed.) 
New York: McGraw-Hill, 1954 

GULLIKsSEN, H. Theory of mental tests. New York 
Wiley, 1950 


representative design 


Berkeley 


E. E., & Ba R. 7 kd 


Studies im oc 


Hare, A. P., Borcatta, 
Small group 
York: Knopf, 1955 


Maccosy, ELeanor E., Newcomt 


LEY, E. L. (Eds.) Readings in ial psychol 
ird ed.) New York: Holt, 1958 
McNemar, Q. Psychological atisti nd ¢ 
New York: Wiley, 1955 
Snepecor, G. W. Statistical method th ed An 


la Iowa State Coll 
Swanson, G. E., Newcoms, T. M., 
(Eds.) Readings in social psychology. (2nd ed 
New York: Holt, 1952. 
Syp1aHa, D. On the equivalence of clinical and ‘ 
tistical methods. J. appl. Psychol., 1959, 43 5 
401. 


Received December 9, 1960) 








Journal of 


lied Psychology 
\ 407 


N 6, 402 


PROTECTION AGAINST IMPULSE-TYPE 


INDUSTRIAL 


NOISE BY UTILIZING THE ACOUSTIC REFLEX* 


JAMES A. CHISMAN 


ANI 


» J. RICHARD SIMON 


State University of Iowa 


In recent years industrial organizations have 
shown a growing interest in noise control. This 
increased concern with the problem of noise 
is due, in part, to recent court decisions and 
legislation favorable to hearing loss compen- 
sation for workers. Though considerable prog- 
ress has been made in controlling continuous 
noises, little has been accomplished in pro- 
tecting workers from loud impulse-type noises 
such as those produced by drop hammers, 
punch presses, dynamite, etc. The distinguish- 
ing characteristic of impulse noise is its rapid 
rise time, i.e., only a few milliseconds elapse 
between the onset and the peak intensity of 
the noise. Impulse noise is particularly haz- 
ardous because its rise time is more rapid 
than the action the intra-aural muscles 
which normally contract to protect the ear 
against loud sounds (Metz, 1951). 


ot 


The purpose of the present experiment was 
to investigate the practicability of protecting 
the ear against industrial impulse noise by ex- 
ternally eliciting the acoustic reflex (AR) ac- 
tion of the intra-aural muscles prior to the 
presentation of the noise. The stimulus used 
to activate the acoustic reflex was a pure tone, 
hereafter referred to as an AR tone. 

It has been known for some time that cer- 
tain tones above approximately 70 db. sound 
pressure level (SPL) elicit a consensual reflex 
contraction the intra-aural (middle 
ear) muscles, the tensor tympani, and the 
stapedius. This response, known as the AR, 
reduces the transmission of sounds through 
the middle ear and acts to minimize possible 
cochlear damage from overstimulation by loud 
sounds (Wiggers, 1937). 


of two 


Until recently, most investigators who have 
studied the intra-aural muscles have utilized 


1 This article MS by 
Chisman under the direction of Simon. The original 
thesis is on deposit at the State University of Iowa 
Library. The authors are indebted to Scott N. Reger 
for his many helpful suggestions. 


is based on an thesis done 


$02 


animal subjects (Wever & Bray, 1937, 1942; 
Wever & Vernon, 1955; Wiggers, 1937). 
However, in recent years, research on human 
subjects has been stimulated by the develop- 
ment of new techniques such as the acoustic 
bridge, and by the possibility of military ap- 
plications. Metz (1951) used the acoustic 
bridge to study the contraction pattern of 
the intra-aural muscles. He found that when 
a 1,000-cycle tone of 100 db. was presented, 
there was a delay of about 35 milliseconds 
before the intra-aural muscles began to con- 
tract and that there was an additional 100- 
millisecond period before the muscles reached 
full contraction. The time for each of these 
phenomena was dependent on the intensity of 
the activating tone, i.e., the more intense the 
sound, the shorter the delay and the faster 
the contraction. It was obvious from these re- 
sults that the AR afforded very little protec- 
tion against noises such as gun blasts which 
have rapid rise times. 

Fletcher and Riopelle (1959) and Fletcher 
(1959) investigated the possibility of protect- 
ing the ear from the noise of machine gunfire 
by externally eliciting the AR action immedi- 
ately before the gun was fired. In Fletcher’s 
experiment, 13 human subjects were exposed 
to 100 rounds of machine gunfire, one round 
at a time, during a 7-minute period. Subjects 
were tested under three conditions: first, with- 
out an AR tone; second, with an AR tone; 
and third, wearing V-51R earplugs. The aver- 
age SPL was 120 db., and the average peak 
SPL was 132 db. For the AR condition, a 
1,000-cps tone of 98 db. was presented 200 
milliseconds each shot. 


before A pure tone 


threshold sensitivity test was performed on 
each subject prior to the firing and again 
within 15 seconds after the firing. The differ- 
ences in db. between the ‘“‘before” and “after” 
firing threshold at each frequency was com- 
puted. This measure is known as the con- 
temporary threshold shift (TTS). Though the 





Protection against Industrial Nois¢ 103 
I'TS differs in several respects from permanent amplifier t¢ 
threshold shift, it is accepted by many as an ‘SPeaker angieeungee in ~Proriginy 
indication of the permanent hearing loss which sara rise ve — ge , 
would be sustained if the subject were sub- 

jected to the same noise over a long period of 


a 15-inch Electro-Voice extended rang 


enclosure The im 


sufficient to produce the desired TTS 
low the level at which there was a dang 
time (Glorig, 1959; Jerger & Carhart, 1956). nent hearing loss. Th 
background noise 
The AR 
previous rest 
earplugs up to and including 1,000 cps but 1955 rest 


sugges 


markedly inferior at and above 2,000 cps. ction. To reduce the possibility 
The overall mean TTS was 19.23 db. with no _— pdf ccerbon _" agn-aeg 
protection, 6.27 db. with the AR tone, and ry pene x 4 
2.50 db. with earplugs. In other words, the pan ADC <3¢7 sill i 

R tone protected the auditory system against igh B 
machine gunfire, but did not provide as much 
overall protection as did the earplugs. 


ol pern 


peak SPL was 139 db., and th 


Fletcher’s results indicated that protection 
by the AR tone was superior to that of the 


tone was 
THOD onds before 


signed to determin ind : — 
would be full 
Metz, 1951 


act 


rnally eliciting the 


protect ing the ear 


study, however, 


n had 
rapid rise tim I re, 1c AR tone itself 
have produced i ; n le present 
of the AR acti was increased gradu 
ver a period of 250 milliseconds. This gradual 
build-up was designed to allow the 


tudy, th 


intra-aural mus 
cles time to contract, thereby preventing the AR tone 
om producing a TTS. 
Fletcher investigated only one frequency of AR 
tone. To determine the relative effect of 


{ 


frequency 
on the AR action, two different frequencies, 250 cps 
and 1,000 cps, were investigated in the present study 
Previous research (Simmons, 1959) had shown that 
juencies below about 2,0 p use the greatest 
tion in transmission through the middle ear 
The protection afforded by earplugs was not in 
estigated in this study since extensive research (Har- 
Ch has already been conducted on the 
f es of industrial noise 9 
ntensity transient wave forms are well known 
Apparatus evidence suggests, hi ree 


equipment us 
this 


the purpose. A 
nd a Tektronix oscillo 


in experiment 


4 Williams n echanica rop hammer 


produce the impulse noise (average rise time, 1 milli scope wel 
} ‘ 


second) for this experiment. The hammer produced the hammer 
impacts in pairs, the two impacts in 


was used to Sonogram 5] 


produced by 

speaker. Th 
a pair being spectra produ were 

separat d by approxin ately 370 milliseconds. Here b] nd 

after, the pair of impacts will be referred to as one 


impact. An Ampex 601 tape recorder was used to re Peak SPL was 


n ired with a Tektronix 
cord 100 impacts of the hammer. To pick up the scilloscope in conjunction with a Western Electri 
noise, a Western Electric condenser microphone was condenser microphone. Average 
placed at the normal position for the operator. The noise 

rval between impact W aried 


SPL and background 
were measured with a General 
randomly from level meter 
A preliminary check conducted on five subjects 
lo reproduce the noise, the tape recorder output indicated that the AR tone with the 
fed through a 50-watt MacIntosh _ hi-fidelity rise tin 


lio sound 
to 6 second 
250 millisecon l 
1e did not produce any TTS 








James A. Chisman 


THRESHOLD SH 
o 





MEAN TEMPORARY 


Mean temporary threshold shift produced 
by the three experimental treatments 


order 


AR tone. The 


atments was randomized 


noise plus the 1,000-cps 
of administration of tre 


impact 

At the start of eact 
audiogram for the 
An ADC 53C 
used to de 


a 


between 2 


experimental section, an 
right ear was obtained 
audiometer calibrated in 5-db. steps was 
termine thresholds for nine frequencies 
) and 12,000 cps. The subject was then 
seated with the speaker reproducing the impact noise 
1 foot from his and the speaker repro- 
ducing the AR his left ear. The 
subject was then exposed to one of the ex- 
perimental treat 100 impacts were 
presented during a 10 period. Finally, a post- 
exposure audiogram was obtained for the subject’s 
right ear. The audiometer test was 
mediately following the 10-minute exposure to the 
The order of measuring the threshold at each 
frequency was randomized in order to control the 
that the TTS might wear off during the 

The TTS was determined by com- 
the pre-exposure audiogram with the post 
audiogram 


subjec t’s 


right ear 
tone 1 foot from 
ments in which 


minute 
conducted im- 
noise 


possibility 
1udiometer test.® 
paring 
exposure 

The experiment was conducted in a soundproof 
room. At least 24 hours elapsed between experimental 
sessions. A comparison of pre-exposure audiograms 
indicated that sufficient to 
complete recovery from exposure to the noise 


this period was allow 


RESULTS 


Figure 1 shows the mean TTS produced by 
the three experimental treatments at each 
frequency measured. Note that the curves for 
the two AR treatments are below the curve 
for the No-AR treatment at all frequencies 
between 500 and 12,000 cps. The largest dif- 
ferences between the AR treatments and the 
No-AR treatment are at 2,000 and at 8,000 
cps. The average TTS is 5.11 db. for the 

5 Subsequent analysis indicated that order of de 
termining postexposure threshold had no significant 
effect 


and J. Richard Simon 


No-AR treatment, 1.44 db. for the 1,000-cps 
AR tone, and 1.11 db. for the 250-cps tone. 
Figure 1 also suggests that the impact noise 
used in this experiment produced a TTS only 
at frequencies above 250 cps. 

Table 1 summarizes the Treatment X Fre- 
quency X Subject (T X F x S) analysis of 
variance used to analyze the data. Since the 
impact noise did not produce a TTS at 250 
cps, the data for this frequency were not in- 
cluded in the analysis.*® The significant inter- 
action between Treatments and Frequency 
(T X F) indicates that the effect of the treat- 
ments on the TTS is not independent of the 
frequency at which the TTS is produced. The 
main effects of Treatments and Frequencies 
are also significant which means that the ex 
perimental treatments produced different over- 
all effects, and exposure to the impulse noise 
produced a different average TTS at the dif- 
ferent frequencies. 


Inspection of Figure 1 suggests that the 
curves representing the two AR treatments 
follow a similar trend. A test of the T x F 
interaction for the two AR treatments alone 


6 The audiometer test was purposely carried out 
beyond the point at which a TTS might reasonabl 
be expected to occur done in order to 
the range within which the 
None of the treatments produced a 
TTS at 250 cps. On a priori grounds, therefore, the 
data at 250 cps were not included in the analysis of 
the difference between the experimental treatments 
A decision to discard data based on anything othe: 


This was 
delimit more adequatels 


rio oct urred 


than a priori grounds would obviously modify tl 


probability of the Type I error 


rABLE 1 
ANALYSIS OF VARIAN 


THREE TREATMENT 


| 75 


9? 9 


I'reatments 

Frequencies (I 
480.8 y 
320.8 14 
392.5 
187.9 18 


193.8 126 





Protection against Industrial Noise 


Table 2) indicates that the trends of the 
AR treatments are, in fact, parallel. A 
test of the Treatment main effect for the two 
AR treatments is also nonsignificant, indicat- 
ing that the trends are not significantly dis- 
placed from one another. It is therefore con- 
cluded that the two AR treatments had the 
same overall effect on the subjects used in 
this experiment. It also follows from these re- 
sults that the trend of the No-AR treatment 
is not parallel to the trend of either AR treat- 
ment. 


(see 
two 


Since the two AR treatments did not differ, 


their TTS for each frequency were averaged 


and compared to the TTS of the No-AR treat- 
ment at each frequency. Table 3 summarizes 
the results of these analyses of variance. At 
five of the frequencies, when the AR tone was 
present, the TTS was significantly less than 
that observed for the No-AR treatment 


frequencies are 2,000, 3,000, 


These 
4,000, 6,000 
and 8,000 cps. There was no difference be- 
tween the AR and No-AR treatments at three 
1,000, and 12,000 


cps. However, the difference between the treat- 


of the frequencies; 500, 


ments at 500 cps was just short of being sig- 
nificant. 

At the end of the experiment, the subjects 
were asked which experimental treatment was 
more pleasing. The unanimous response was 
that the 250-cps AR tone was the most pleas- 
ing and comfortable. 


rABLE 2 
SUMMARY OF ANAL‘ 


ror AR 


TREATMENTS 


PABLE 3 
SUMMARY OF RESULTS OF ANALY 
VARIANCE AT | 


ACH FREQUE) 


500 
1,000 
2.000 cy 
3.000 eps 
4.000 c1 
6,000 cps 
8,000 cps 

12,000 cps 


DISCUSSION 


Results of this study indicate that extern- 
ally eliciting the acoustic reflex is an effective 
means of protecting the auditory 
against an industrial impact 
wide 


system 
noise. Over a 
the temporary 
threshold shift was significantly less when an 


range of frequencies, 


AR tone preceded each impact than when the 
impact alone was presented. This protection 
resulted despite the fact that the total acoustic 
stimulus greater in the AR 
than in the No-AR treatment. 

No attempt was made in this study to com- 
pare the protection provided by an AR tone 
with the protection afforded by earplugs. The 
only study in which these two methods of 
protection have been compared directly indi- 


was treatments 


cates that earplugs afford superior protection 
at frequencies above (Fletcher 
1959). Earplugs, however, have several serious 
drawbacks. First, they make normal conversa- 
tion difficult. Second, they are often un- 
comfortable and in some cases cause infection. 
Third, and for the above two reasons, workers 
often refuse to use earplugs. Finally, in order 
to be effective, earplugs must be fitted to the 
individual. It is often difficult to attain and 
retain the perfect seal which is essential. 
Protection by means of an AR tone has 
several obvious advantages over earplugs. 
First, AR only when 
necessary. conversation 
be carried on after the noise has termi- 
nated. Second, the AR tone, particularly the 


1,000 cps 


protection operates 


Therefore, normal 


may 








406 


250-cps tone, does not produce discomfort or 
unpleasantness. Third, the AR tone can be 
presented automatically. Therefore, protection 
does not depend on the cooperation of the 
worker. Finally, the AR tone can serve the 
additional function of acting as a pre-impact 
warning device for workers in the area. 

The results of this limited investigation 
when viewed in the light of the many disad- 
vantages of earplugs suggest that activating 
the acoustic reflex may be a practical and ef- 
fective way of protecting industrial operators 
from impact noises. With most industrial ma- 
chines of the impact type, the operator is con- 
fined to a limited area. It would not be 
difficult to install a tone generator near the 
operator’s ear and, by means of an electronic 
timer integrated with the machine, to control 
the time relations between the AR tone and 
the impact. The timer could also be employed 
to cut off the AR tone an instant before each 
impact, thus reducing the amount of energy 
it the moment of impact. The operator would 
still be afforded maximum protection since the 
intra-aural muscles remain fully contracted 
for about 250 milliseconds after the AR tone 
has ceased (Metz, 1951). 

One final point deserves some comment. The 
overall TTS produced in this study was sub- 
stantially less than that observed in Fletcher’s 
study although the average noise levels re- 
ported in the two studies were similar. There 
are several possible reasons for the differ- 
ing results. First, Fletcher points out that, 
because of the instruments he used, his esti- 
mates of SPL were conservative (Fletcher & 
Riopelle, 1959). The instruments used in the 
present experiment were more appropriate for 
measuring noise with a rapid rise time. Sec- 
ond, the 139-db. peak SPL reported in the 
present experiment is not comparable to the 
132-db. peak SPL reported by 
Fletcher. In a later experiment which em- 
ployed Fletcher’s setup, (Hilding, 1960), the 
peak SPL was estimated to be 155-160 db. 
which is considerably higher than the 139-db. 
peak in the present study. Third, the noise 
spectra used in the two studies were consider- 
ably different. Fletcher’s noise spectrum was 
at the high end of the audio range, and the 
noise spectrum in the present study was at 
the low to middle end. Fourth, Fletcher pre- 


average 


James A. Chisman and J. Richard Simon 


sented more impulses per unit of time, i.e., 100 
impulses in 7 minutes versus the 100 impulses 
in 10 minutes used in the present study. 


SUMMARY 


This experiment was designed to investi- 
gate the practicability of externally eliciting 
the acoustic reflex to protect the ear from in- 
dustrial impulse noise. Ten subjects were ex- 
posed to 100 impacts of the recorded noise of 
a mechanical drop hammer during a 10-minute 
period. The noise was presented at a 120-db. 
average SPL. Subjects listened to the same 
noise under three conditions. In Condition 1, 
the impacts were presented alone, i.e., with no 
AR tone presented. In Condition 2, a 250- 
cps, 100-db. tone was presented between 400 
and 600 milliseconds before each impact. 
Condition 3 was the same as Condition 2 ex- 
cept that a 1,000-cps tone was used. Pre- 
exposure audiograms were compared with 
audiograms to determine the 
temporary threshold shift (TTS) under each 
condition. 


postexposure 


Analysis of the data indicated a significant 
difference in TTS between the No-AR con- 
dition and the two AR conditions at the fol- 
lowing frequencies: 2,000, 3,000, 4,000, 6,000, 
and 8,000 cps. No differences were observed at 
250, 500, 1,000, and 12,000 cps. In terms 
of overall protection, the 250-cps AR tone did 
not differ from the 1,000-cps AR tone al- 
though subjects found the 250-cps tone to be 
more pleasing. 

Results are interpreted to suggest that 
elicting the acoustic reflex prior to an impact 
may be an effective means of protecting the 
ear against industrial impulse noises. The ad- 
vantages of AR protection over protection 
provided by earplugs are discussed. 


REFERENCES 


Fretcuer, J. L. Comparison of attenuation charac- 
teristics of the acoustic reflex and the V-51R ear- 
plug. USA Med. Res. Lab. Rep., 1959, No. 397. 

Fretcuer, J. L., & Rroperre, A. J. The protective 
effect of acoustic reflex for impulsive noises. USA 
Med. Res. Lab. Rep., 1959, No. 396 

Groric, A. Current research in industrial noise 
Control, 1959, 5(1), 32-35, 74 

Harris, C. M. (Ed.) Handbook of noise 
New York: McGraw-Hill, 1957. 


N ise 


control 





Protection against Industrial Noise 


Hitpinc, D. A. The intratympanic muscle reflex as a 
protective mechanism against loud impulsive noise 
Ann. Otol. Rhinol. Laryngol., 1960, 69, 51-60 

Jercer, J. F., & Carnart, R. Temporary 
shift as an index of noise susceptibility. J 
Soc. Amer., 1956, 28, 611-613 

Metz, O. Studies on the contraction of the tympanic 
muscles as indicated by changes in the 
of the ear. Acta otolaryngol., Stockh. 
397-405. 

Srmmons, F. B. Middle ear muscle activity at mod- 
erate sound levels. Ann. Otol. Rhinol 
1959, 68, 1126-1143 


threshold 


Acoust. 


impedance 


1951, 39, 


Laryngol., 


407 


Wever, E. G., & Bray, C. W. The tensor tympani 
muscle and its relation to sound conduction. Ann 
Otol. Rhinol. Laryngol., 1937, 46, 947-961 

Wever, E. G., & Bray, C. W. The 
in relation to sound conduction. J 
1942, 31, 35-43 

Wever, E. G., & Vernon, J. A 
tympanic muscle reflexes upon 
Acta otolaryngol., Stockh., 

Wiccers, H. C. The 
muscles. Amer. J 


tapedius muscle 


exp. Ps) 


The effects of the 
sound transmission 
1955, 45. 433-439 

functions of the 


Physiol., 


intra-aura 


1937, 120, 771-78 


(Received December 1 19¢ 








Journal of Applied Psycholog 
1961, Vol. 45, No. 6, 408-414 


A FACTOR ANALYSIS OF THE CALIFORNIA 
PSYCHOLOGICAL INVENTORY 
JOHN 0. CRITES, HAROLD P. BECHTOLDT, LEONARD D. GOODSTEIN 
snp ALFRED B. HEILBRUN, J: 


University of Iowa 


Although developed over a period of years 
by Gough (1957) and based in part upon re- 
search with the Minnesota Multiphasic Per- 
sonality Inventory, the California Psychologi- 
cal Inventory (CPI) is a relatively unstudied 
newcomer to the area of personality measure- 
ment. A multidimensional inventory of nor- 
mal personality characteristics, the CPI con- 
sists of 18 separate scales grouped into four 
areas of “adjustment and development.” The 
first area, which describes aspects of social 
functioning, includes measures of Dominance 
(Do), Capacity for Status (Cs), Sociability 
(Sy), Social Presence (Sp), Self-Acceptance 
(Sa), and Sense of Well-Being (Wb). The 
second area, which concerns emotional and so- 
cial development, is comprised of scales on 
Responsibility (Re), Socialization (So), Self- 
Control (Sc), Tolerance (To), Good Impres- 
sion (Gi), and Communality (Cm). The third 
area, which relates to actual and potential 
achievement in educational and occupational 
pursuits, measures of Achievement 
(Ac), Achievement via In- 
dependence (Ai), and Intellectual Efficiency 
(Ie). The final area, which assesses intellec- 
tual and interest modes, has measures of Psy- 
chological-Mindedness (Py), Flexibility (Fx), 
and Femininity (Fe). The various CPI scales 
were grouped into these four areas in order to 
“emphasize some of the psychological and 
psychometric clustering which exist among 
them” (Gough, 1957, p. 7). The groupings 
were not based upon factor analysis, however, 
but rather upon inspection of intercorrelations 
among scales 


contains 
via Conformance 


PROBLEM 


In this study factor analysis of the CPI 


scale intercorrelations was used to accom- 


plish a number of purposes. The major intent 


of the investigation was to test Gough’s sub- 
jective groupings of CPI scales against the 


more objective clusters derived from a factor 
analysis. An additional objective was to re- 
duce the 18 scales of the CPI to a more eco 
nomical set which would not only result in 
more efficient testing but which would facili- 
tate pattern analysis in the interpretation of 
score configurations on the CPI profile. A final 
purpose was to evaluate the extent to which 
the CPI 
personality functioning 


assesses different facets of normal 


PROCEDURI 
Subjects 


A sample of 372 subjects, stratified along two di 


mensions, sex and client status, was available for th 


With 


females were 


factor analysis respect to the variable, 


males and treated separately for two 
first, raw 


groups wert 


hence norms for the 


scores and 

different (Gough, 1957); and 
second, it was thought that the scale interrelation 
ships for the sexes might differ, since different 
role expectations for men and women probably re- 
sult in unique configuration of personality charac- 
teristics. With regard to client status, client and non- 
client groups were selected for independent analysis 
because a p indicated that differ 
in both kinds and degrees of adjustment problems 
(Goodstein, Crites, Heilbrun, & Rempel, 1961). One 
group of clients, designated “per “ 
(P), included all individuals who upon intake at a 
university counseling center checked the 
category on an application card: “Would like to dis 
cuss my feelings about myself and others; 
sonal problems which are bothering me.” The second 
group of clients, which was termed “vocational-edu 
cational” (VE), consisted of persons who at intake 
endorsed either or both of these statements: “Would 
like to discuss my abilities, interests, aptitudes; my 
occupational plans for the future,” and “Would like 
to discuss my courses, grades, classes, study tech 
niques; my progress here at the University.”’ Follow 
ing initial application for counseling, but preceding 
interviews with a counselor, the CPI was routinely 
administered to all clients. The nonclients, or con 
trols (C), were members of a sophomore and junior 
level psychology course, who were tested with th 
CPI during a class period. The total sample of 372 
was stratified, then, according to the two sexes and 
three categories of client-nonclient status to yield six 
groups of 62 subjects each for the factor analysis 


reasons: 


two 


SOC ial 


revious study they 


which was sona 


following 


some. per- 


108 





Factor Analysis of the California Psychological Inventory 


Design 


Scores on the 18 CPI scales for all subjects in the 
various groups were first punched on cards for use 
with the IBM 650 computer. The next step in the 
analysis was to test the homogeneity of the variance- 
covariance matrices for the six samples (Anderson, 
1958). If these were relatively homogeneous, then the 

could be combined for the factor analysis 
proper. The findings indicated, however, that the as- 
sumption of homogeneity was untenable. To locate 
the sources of heterogeneity, the variance-covariance 
matrices were compared for the six samples stratified 
by sex and client-nonclient status. In these compari 
sons, there were no differences between sexes which 
significant at the .01 level, although one was 
ignificant in the C group at the .05 There 
differences at the .01 level, however, 
VE, and C groups in all possible com 


was decided, th 


groups 


were 
level 
re significant 
the P 
binations. It 


male and female 


tween 
erefore, to combine the 
data (average within-group correla- 
groups and to perforn 
remaining 


tests of 


lient 


for the client-noncli 


he tact vses only on the latter. Th 
steps in the pr jure involved a inde- 
pendence, usi the determinants of the population 
estimates of the €-COV » and correlational 
matrices (Anderson, 1958); ( 

for the 
the complet 
(¢ tests tor 
Lawley’s maximum-likelihood 
1951); and (d) and graphical rotation of 
the factors to an oblique simple structure 
1947) 


iriamn 
b) computation of fac 
fll ; 

following 


1947 


factors based on 


three client-nonclient groups 
(Thurstone 
ignificant 


centroid method 
number of 
procedure Thomson 
analytical 
Thurstone 
In the selection of reference tests for each of five 
factors, two criteria were used. A set of CPI scales 
which had the 


first identified 


largest loadings on the factors was 
and then selections from the set were 
each 


overlapping items in the 


made, one for factor, to minimize the number 


various scale A sixth 
ence test was selected to represent 2a dimension 
ot defined by the five-factor scales. Thus, 


dimension meet 


chosen to represent a factor or 
requirements: (a) it has a high, although no 
sarily the highest, loading on a factor; and (6b 
the fewest number of items in common with 
which represent The latter 
was necessary because the empirical approach used 
CPI 
vari 


other factor criterion 


by Gough in the construction of most of the 
scales resulted in considerable item overlap, a 
able which be: with the 
interrelationships of the scales when they are 
lated. Unless this lack of experimental independenc« 
is eliminated, it is difficult 


omes confounded statistical 


corre 


to analyze the common 
variance among the scales 

CPI 
selected, multiple correla 
the selected set and the 
These multiple Rs indicated 
the selected scales reproduced 


To evaluate the adequacy of the reference 


cales which were finally 


tions between remaining 


scales were computed 
the extent to which 


performance on the unselected scales and hence were 


different scale clusters 


R’s provided an estimate of the 


groups o 


representative 
Furthermore, the 


amount of the total variance accounted for 
reference scales 


RESULTS 


of the 
and C 


The tests of independence for each 
correlation matrices of the P, VE, 


groups, as presented in Table 1, were signifi- 
All of the determinants for the g 
267 


cant. groups 
exceeded the expected value of (SE 

.0402) at the .01 level. These findings, 
which indicated significant degrees of depend- 
ence in the matrices, supported factorial analy- 
ses of the CPI scale intercorrelations for the 
three client-nonclient groups. 

For each of the three groups, the first six 
centroid factors were significant ac- 


of eight 
cording to Lawley’s (1951) test after iteration 
through two cycles by Bargmann’s (1957) 
procedure. With 60 df, the x? values of 216.7 

168.77, 140.92 for the P, VE, and C 
groups, respectively, exceeded the .001 level 
Since the than ~.10, it 
appeared that the six factors accounted for 
most the CPI 
scales and that oblique rotations of five or six 
factors were justifiable. The first analytical 
approximation to a simple structure was poor, 


and 
residuals were less 


of the common variance of 


however, probably because many of the scales 
were not experimentally independent and 
hence formed an unusual number of doublets, 
but successive graphical rotations eventually 
yielded solutions which were more adequate 
and which were quite uniform for the three 
groups. The best solution was based upon the 
five Table 
communalities (4°) for the six centroid 
tors indicated that 
variance remained. Note that there are 


factors listed in although the 
fac- 
unaccounted for residual 
some 
variations in the factor loadings across groups 
but that these are considerably less marked 
than the consistencies. 

From the factor matrices five scales, which 
had the highest loadings and 
overlappings items 


duced set of CPI variables for 


factor fewest 
selected as a re- 


a multiple re- 


were 


gression analysis of their relationships to the 
remaining scales 
cluded the following: 


The selected set of scales in 
Dominance (Do), Good 
Impression (Gi), Communality (Cm), Flexi 
bility (Fx), and Femininity (Fe). With these 
five scales as the predictor variables and the 
other 13 scales as the criterion variables, beta 








LT Ol 
99 OI 
lt ZI 
686 


ZOOL 


SlOl 
2s ara | 
or tl 


80 TT 
rams! 
OF IT 
OF TI 
98 TI 
IF Or 
9 OL 
ILL 


fO Tl 
IF Ol 
IS Ol 


= 
~ 
= 
= 
~— 
“~ 
~~ 
~~ 
= 
= 
~ 
- 
= 
“= 
~ 
= 
a 
~ 
~ 
Ss 
= 
~ 
= 
~ 
~ 
an 
~ 
~ 
‘= 
~ 
~ 


OF TI 
FP TI 
Srl 


ds 
ral us YICI 105] 


lout) (- IOMLNOD AN IA) TVNOILVONAY—TVNOILVOOA 
AMOINAANT TvO IOHDASG VINWOATIVI IO SNOILVTINAOOMALN] 


VIaVv 








Ost 
96 US 


POLS 


OO TS 


l Inventory 


Licd 


= 
= 





412 Crites, Bechtoldt, Goodstein, and Heilbrun 


weights and Rs were obtained for the P, VE, 
ind C groups combined (df = 366) since the 
variance-covariance matrices for the predic- 
tors were homogeneous. The residual intercor- 


relations among the criterion variables in this 
analysis revealed that Intellectual Efficiency 


(le) was highly related to each of them and 
moderately related to the five factor scales. 
Thus, Intellectual Efficiency was included in 
the selected set of CPI scales as a variable 


which may account for part of the oblique 
ness among the factors as well as the sixth 
factor variance. Table 3 lists the beta weights 
and R°*s for the prediction of each of the 12 
unselected scales from the combined 6 refer- 
ence scales. The R*s range from .430 to .734 
and average .542. The sum of the specific and 
selected 
of the 


common variance predicted by the 
set accounts for approximately 66% 
total variance in the 18 CPI scales. 


TABLE 2 


TATED OBLIQUE FACTOR | 


FOR THE PERSONAI P), 


Factors 
CPI 


Scales Groups 


OADINGS OF THE CALIFORN 
VOCATIONAL-EDUCATIONAI 





Factor Analysis of the 


California Psychological Inventor 


PABLE 3 


Bera Weicuit NI PLE Rs FOR THI 
SCALES 


df = 


NVENTOR 


050 
049 
021 
O18 
100** 
171** 
245** 
093** 
073* 
119** 
028 
047 


DISCUSSION 


The results of the factor analyses both agree 
and disagree with Gough’s clusters of scales 
for the CPI. With one exception, the Domi- 
nance scale clearly represents the Class I 
scales which Gough terms “Measures of Poise, 
Ascendancy, and Self-Assurance” and which 
include the Capacity for Status, Sociability, 
Social Presence, and Self-Acceptance scales. 
The exception is the Well-Being scale, which 
Gough classified with these variables, but 
which has higher empirical relationships with 
the Responsibility, Socialization, Self-Control, 
and Tolerance scales, group of measures best 
defined by the Good Impression scale. Closely 
related to this group, but yet distinct from it, 
the Communality scale probably belongs with 
these Class IT ‘‘Measures of Socialization, Ma- 
turity, and Responsibility.’ In Glass ITI, 
‘Measures of Achievement Potential and In- 
tellectual Efficiency,” the Intellectual Effi- 
ciency scale correlates highly with Achieve- 
ment via Conformance and Achievement via 
Independence, but it also relates well to most 
of the other scales in the CPI. The final group 
of scales, the Class IV “Measures of Intellec- 
tual and Interest Modes,” breaks down into 
quite specific factors, as assessed by the Flexi- 
bility and Femininity scales, which have only 
negligible relationships to each other and to 


BASED UPOD 


REFERENCE AND UNSELECTED 


THE COMBINED GRO 


306 


Ix 


149** 
O16 
259** 
057 
O17 
BY ag 
239** 
110** 
108** 
Q90* 
303** 
096" 


the other measures. Thus, the findings support 
two of the four scale clusters or classes which 
Gough proposes but offer no evidence for the 
existence of the other two. 

The six scales which emerge from the factor 


and multiple regression analyses provide an 


economical but comprehensive coverage and 
evaluation of the major aspects of the normal 
personality. Two of the scales, Dominance 
and Good Impression, seem to measure mark- 
edly different modes of adjustment or ways of 
coping with the environment and others. The 
Dominance scale reflects an inclination to 
master and control people and things (Horney, 
1945) and a tendency to direct and manipu- 
late affairs: adjustment comes from changes 
in external reality rather than in the self. In 
contrast, the Good Impression scale defines 
an attitude of compliance, cooperation, and 
conformity (Riesman, 1950): adjustment de- 
rives from adaptations of the self to reality. 
Though distinctive, these two adjustment 
modes appear equally effective, as indicated 
by the positive relationships of the Dominance 
and Good Impression scales to the Intellectual 
Efficiency scale which apparently assesses a 
general level of adjustment variable and which 
corresponds to the over-all elevation of the 
CPI profile. The Femininity and Flexibility 
scales supplement these measures of adjust- 





414 


ment patterns and status with scores on two 
highly specific attitudinal and trait dimen- 
sions. Finally, the Communality scale, despite 
its relationship to the Good Impression group, 
contributes unique variance to the reference 
scale set and probably measures the tendency 
to make “modal” responses to the CPI, as 
stated in the manual (Gough, 1957). The 
scales yielded by the factor and correlational 
analyses, then, survey most of the variables 
important in the description and definition of 
the normal personality: adjustment modes, 
adjustment level, attitudes and traits, and 
test-taking set. 

That the various statistical analyses indi- 
cate that a reduced set of six scales accounts 
for about two-thirds of the total variance in 
the CPI measures strongly suggests the use 
of the reference scales for more economical 
testing and more meaningful score interpreta- 
tions. Not only are these scales relatively in- 
dependent statistically, but they are almost 
wholly unrelated experimentally. With the ex- 
ception of the Intellectual Efficiency and 
Dominance scales, which have two items in 
common, there is no item overlap among the 
six scales. As a result, the individual scales 
are more sharply defined, and pattern analy- 
ses of scores on combinations of scales are 
more clearly formulated. Since the mean 
values of the three groups tend to differ on 
these selected further discriminant 
function analyses may be of value. The ulti- 
mate meaning and usefulness of the reference 
scales depends, of course, upon their relation- 
ships to other, external variables. Research 
currently in progress deals with the problem 
of validating the reference scales and consti- 
tutes the next phase in this series of studies 
on the CPI and the measurement of the nor- 
mal personality. 


scales, 


SUMMARY 


The purposes of this study were to test with 


factor analytic methods the rational groupings 
of the 18 scales of the California Psychologi- 
cal Inventory (CPI) into four areas or classes 
and to determine the number of independent 
variables measured by the scales. The sample 


Crites, Bechtoldt, Goodstein, and Heilbrun 


consisted of 372 male and female subjects 
classified into three groups: personal adjust- 
ment clients, vocational-educational clients 
from a university counseling center, and non- 
client controls from an undergraduate psy- 
chology course. The factor analyses, which 
were performed on the intercorrelations of the 
CPI scales for each group separately (df 

122), yielded a reduced set of six reference 
scales which exhibited homogeneous correla- 
tion matrices for all groups. These scales, 
which included measures of Dominance, Good 
Impression, Intellectual Efficiency, Flexibil- 
ity, Femininity, and Communality, repre- 
sented group, general, and specific factors 
which corresponded to adjustment modes, ad- 
justment level, personal traits, and test-taking 
attitude. The findings supported some of the 
original scale clusters but not others and in- 
dicated that the six reference scales predict 
most of the reliable variance in the other CPI 
measures. The conclusion drawn from the re- 
sults was that the reference scales provide 
more concise and less ambiguous definitions 
of the variables assessed by the CPI and have 
considerable promise as measures of the nor- 
mal personality. 


REFERENCES 


Anperson, T. W. Introduction to multivariate 
tistical analysis. New York: Wiley, 1958 

BARGMANN, R. A study of independence and depend 
ence normal analysis. University of North Caro- 
lina, Institute of Statistics, 1957, Ser. No. 186 
(Mimeo) 

Goopstrtn, L. D., Crites, J. O., Hermprun, A. B., 
Jr., & Remper, P. P. The use of the 
Psychological Inventory in a university counsel- 

Psychol., 1961, 8, 147-153 

California Psychological Inventory 

Alto, Calif.: Consulting 


California 

ing service. J. counsel 

Goucu, H. G 
manual. Palo 
gists, 1957 

Horney, Karen. Neurosis and human growth 
York: Norton, 1950. 

Law.ey, D. N. The maximum likelihood method of 
estimating factor loadings. In G. Thomson (Ed.), 
The factorial analysis of human ability. (4th ed.) 
New York: Houghton Mifflin, 1951 

RresMan, D. The lonely New 
Univer. Press, 195¢ 

Tuurstone, L. L. Multiple-factor analysis 
Univer. Chicago Press, 1947 


Psv c holo 


New 


crowd Haven: Yale 


Chicago: 


(Received December 23, 1960) 





Journal 
61. \ 


Applied Psychology 
45, No. 6, 415-419 


CHANCE VERSUS NONCHANCE SCORES ON THE SVIB' 


RICHARD R 


STEPHENSON * 


University of Iowa 


Scores on the Strong Vocational Interest 
Blank (SVIB) are usually reported on either 
of two specially devised profile forms, the 
Hankes Report Form or the IBM Report 
Form. These special profiles (Strong, 1959) 
which are 
. . [profiles are] printed so that all 


indicate which scores are significant and 
not, for the . 


nonsignificant [italics added] scores must be plotted _ 


whereas significant scores 


upon a gray background 
will fall upon a white background (p. 7) 


The gray background indicates the “chance” 
range. The chance ranges were 
determined by marking 40 blanks on the basis of 
dice throwing and the shaded areas give the middle 
68 per cent of the distribution of such chance scores 
for each scale. Scores which fall in the shaded areas 
may be interpreted as easily obtainable by 
and therefore indeterminate for the particular scal 
in question (p. 7) 


chance 


While the profile forms may be shaded on 
the basis of the above method, the chance 
means may be more accurately located by 
taking the standard score equivalents of one- 
third of the algebraic sums of the scoring 
weights for each possible response to each 
item for each occupational scale (Strong, 
1959, p. 34). Mathematically exact standard 
deviations for the distribution of chance 
scores have been published by Lyerly (1957). 
Thus, for most occupational scales, it is pos- 
sible to precisely delimit the chance range 
(mean, plus and minus one standard devia- 
tion) in standard score units for each occupa- 
tional scale, which makes more relevant the 
question of their significance. 


‘Based in part upon a paper presented at the 
79061 American Personnel and Guidance Association 

avention in Denver, Colorado 

The writer wishes to express his gratitude to 
Leslie A. King of the General College of the Uni 
versity of Minnesota and Mabel K. Powers of the 
Senior Division, College of Literature, and 
the Arts, of the University of Minnesota, for making 
available their data without which the present re 
search would not have been possible; and to Richard 
L. Walker, Research Assistant, for his assistance in 
the tedious task of retabulating the above data. 


Sc i€ nce, 


If, as Strong indicates, scores which fall in 
the chance range are nonsignificant, then it be- 
comes important to assess how much of one’s 
data is “nonsignificant” for a given counsel- 
ing client and to compare this result with 
relevant normative data. It becomes even 
more important to have some estimate of the 
amount of change to be anticipated in the 
event that retest is decided upon in the hopes 
of obtaining a “significant” finding. Finally, 
and perhaps most important, if a given SVIB 
profile is composed of two different “kinds’’ 
of data, chance and nonchance, then it may 
be necessary to re-evaluate all of the research 
findings on SVIB stability. The implication 
in all of these questions is that if a score falls 
in the chance zone because of “chance” fac 
tors (factors that are unknown or random), 
then, on retest, such scores might shift to any 
other location on the given occupational scale. 
Nonchance scores, on the other hand, ought 
to vary on retest only within the limits of the 
standard error of measurement of the scale in- 
volved. 


MrTHOD 


While the above considerations form the gener 
background for the present study, the 
petus was found in a study (Stephenson, 1961) don 
on high ability, male, arts college To ol 
tain SVIB letter grade normative data for this group 
such letter grades were tabulated under the Cate 
gories A, B+, B, B and C+, C, and Chance. Thx 
B and C+ letter combined into 
single category as a result of 
that falling in these 
“neutral and ... can very ignored’ 
435). The Chance used to denote 
score falling in the chance area. The 
exclude these nonsignificant, chance, scoré 
However, there 
Chance category than in any other 
These data provided the nucleus of 
data. 

Next, SVIB 

1956) and by 


speci im 


freshmen 


grades were 
Strong’s (1943 
servation scores categories ar 
largely be 
category was 
intent 
analysis 


were more cores 


Power 


Power 


data, made available 
King (1957 
sample consisted of 109 males tested in 1931 and 
again in 1941. Most of these men were in nonpro 
fessional noncollege work, and all had a history of 
unemployment sometime 


), were inspected 


during this 10-year span 


$15 








116 


King’s 
college freshmen, who were tested in 1954 and again, 
approximately 9 months later, in 1955. Results of 
the first testing of both Powers’ and King’s samples 
were tabulated separately according to the SVIB let- 
ter grade categories described for Stephenson. These 
data provided additional chance score normative data 

Finally, 100 SVIB answer sheets were scored by 
the Rand Corporation’s (1955) table of ran- 
numbers, a million digit This 
arge enough that no digit series was used more than 
once. The first 9 digits were used to answer the Like, 
Indifferent, Dislike questions, and all 10 digits the 
Most Liked-Least Liked response sections. These 
data complete the normative study. — 

To the 


nonchance scores, test-retest data were 


ample consisted of 242, generally low- ability 


use OI 


dom series series is 


assess relative stability of chance versus 
necessary. As 
indicated, these were secured from Powers and King 
Each test score (initial testing) for each subject in 
both samples was tabulated on each of 44 occupa- 
tional scales of the SVIB, using the following pro- 
cedure: Each score was first dichotomized as being 
in the chance range, using algebraically derived means, 
plus and minus mathematically exact standard devia- 
tions, or out of the chance range. This standard score 
tabulated. The then, were 
simply in terms of SVIB standard scores, and were 
computed for each subject This 
procedure provides an answer to the question: Do 
that have fallen in chance first 
testing differ more at retest than do scores that hav 
originally fallen outside the chance area? 

The rationale for the above question is quite sim- 
ple: -If chance is chance, indifference, or any 
thing but measurement, then there is no prediction 
as to where such scores will fall upon retest 
other hand, 
the chance area, 


was retest tabulations, 


and for each scale 


scores areas upon 


or 


On the 
fall outside 
retest deviations should reflect only 
predictable errors of measurement. Nonchance 
then, provide the criterion against which to answer 
the above research question 


for scores that originally 


scores, 


The absolute deviation 


was chosen as the appropriate measure since, under 


Richard R. 


Stephenson 


the assumption, both chance and nonchance 
will be as likely to 
other, and an 


average to zero 


SCOres 
the 
therefore, 


vary in one direction as in 


algebraic measure would, 


FINDINGS 


For greater ease in making visual compari- 
son, SVIB letter grade normative data for all 
normative samples have been converted to 
percentages and are presented in Table 1. As 
can be seen in Table 1, the best guess of a 
subject’s score for any scale is chance. Even 
considering that Stephenson’s sample consists 
of high ability college males, that King’s con- 
sists primarily of low ability college males, 
and that Powers’ consists primarily of noncol- 
lege males, it seems that chance as a modal 
score is a common phenomenon. 

A common distributions chi square test of 
the three male samples in Table 1 resulted in 
a chi square of 173.104, which, with 10 df, is 
significant far beyond the .001 level. Thus, it 
appears that, in groups that differ widely in 
scholastic aptitude, significant differences are 
to be expected in relative proportions found 
within each SVIB letter grade category, in- 
cluding the category Chance. However, the 
rank-difference correlation (rho) between 
King’s sample and Powers’ sample is 1.0 and 
between either of these and Stephenson is .94. 


Table 2 summarizes the findings when the 


retest absolute deviations are compared over 
SVIB occupational scales, and may be inter- 
preted as indicating that, while average scale 
deviations are significantly larger after 10 


rABLE 1 


PERCENTAGES OF SCORES IN THI 


SVIB 


CHANCI 


RATINGS ON 44-SCALE PROFILES 


College 
ability 
(NV 


ZONE 
FOR THREE 


OR IN ONE OF THE VARIOUS LETTER GRADI 


DIFFERENT SAMPLES OF MALE PERSONS 


Blanks scored 
with table of 
random numbers 
(N = 100) 


College high 
males ability males’ 
= 242 V = 239 


low 





Chance 


rABLE 


WHEN AVERAGI 


SUMMARY OF FINDINGS 


COMPARED OVER SVIB OccuUPATIONAI 


versus Nonchance Scores on SVIB 


) 


ABSOLUTE DEVIATIONS 


SCALES 


Low ability college 


subjec 


tc4 


V = 242 


Variab 


N of scales 

V of ditlerences 

Most variable scale’s 
average absolute deviati« 

Least variable scale’s 
iverage absolute deviati 
ital scales’ average 
absolute deviations 

Average absolute dev 
per scale 

Standard deviation 


averages 


SCA 


Not-i 


chance 


293. 0395 


ata tro King (1957 

ata from Powers (1956 
ocial science high school tea 
> - 1 director 


ne 


years than after 9 months, for a given period 
of time there is no significant difference be- 
tween scores that fall in chance on a particu- 
lar scale and scores that fall out of chance on 
that scale. 

It should be noted that Table 2 summarizes 
findings four distributions of 44 
scales each. Each scale average is based upon 
the NV that happened to fall in or out of the 
chance range on that particular scale. These 
from 7 in-chance, Car- 
penter) to 206 (King: not-in-chance, Au- 
thor-Journalist, and Forest Man). 
Further, the occupational scales themselves 
have differing reliabilities. Thus, there may 
be significant individual differences in chance 
and nonchance variability that are masked 
by collapsing the data over scales. The ques- 
tion that 


based on 


Vs vary (Powers: 


Service 


now must be answered is this: Are 


the in 
chance variability and the not-in-chance vari 
ability of these individuals over these 44 oc 


there significant differences between 


cupational scales? The data to answer this 


mary fashion 


question are presented in sun 
as Table 3. 

Table 3 
that 
vears than after 9 months, but 


indicating 
after | 


after this long 


may be interpre ed as 


individuals are more variable 


are no 


more variable on their originally ir 


chance scales than o1 illy not-ii 


chance scales, which is 


reached when the data 
scales 
Note. 
significant 
chance and the 
the two groups in King’s sample 


that in Table 3 there 


difference 


however 
mean between the in 
not-in-chance variability of 
In explain- 
significant difference, at least two 


ing this 








Richard R. Stephenson 


TABLE 3 


SUMMARY OF I 


INDINGS WHEN AVERAGE ABSOLUTE DEVIATIONS ARI 


COMPARED OVER PERSONS 


Low ability college 


subjects* 


In 

Variable chance e 
V ol persons 
N of scales 
\bsolute deviations of 

most variable subje ct 
\bsolute deviations of 

least variable subject 

tal group’s absolute 


leviations 


\verage deviation per subject 


ndard deviation of 
} 


bution of subjects 


® Dat ) 
Data fro 


thoughts come immediately to mind: First is 
that the absolute magnitude of the signifi- 
cant difference is less than one standard score 
point. This small size questions the practical 
significance of the statistical finding. Second, 
the direction of the difference is opposite to 
that predicted by the hypothesis under in- 
vestigation: that in-chance scores are the more 
variable. Thus, though new hypotheses may 
be generated by this finding, the general con- 
clusions of the present investigation, that in- 
chance scores are not more variable than not- 
in-chance scores, cannot be altered. (The 
negligible differences in means between Tables 
2 and 3 are the result of grouping errors in 
Table 3 and rounding errors in both tables.) 


DISCUSSION 


The most obvious fact to emerge from 
analysis of these data is the evidence, per- 
haps not needed by this time, of the high 
test-retest reliability of the SVIB. The largest 
mean difference observed is but seven stand- 
ard score points, and the variabilities of all 
distributions are quite small. Related to this 


Noncollege subjects” 


Not-in 


chance 


Not-in In 


( chance 


109 109 
44 4 


16.5 


relatively inconsequential test-retest variabil- 
ity is the obvious similarity of the in-chance 
and not-in-chance groups in the two samples, 
observed differences in most cases being in 
the range of one or two standard score points. 
These data, then, provide no basis upon 
which to question the many findings on high 
test-retest reliability of the instrument in 
question, on the grounds that SVIB profiles 
contain two different “kinds” of data. 

Thus, while a given range on the raw score 
scale of any occupational scale may be inter- 
preted as “nonsignificant,” it does not seem 
reasonable to make this interpretation on the 
basis that the score arrived at that point 
through “chance” factors. The finding that 
the modal for these subjects is a 
“chance” score is probably an artifact of the 
fact that most subjects do not have interests 
similar to the interests of a specific occupa- 
tional criterion group. 

The implications of these findings are at 
least threefold. Most important is the prob- 
lem of how to interpret “gray areas” to, with, 
or about counseling clients. This is, basically, 


score 





Chance 


the question: Why have gray areas on profile 
sheets? While there may be some utility in 
calling attention to the zero raw score point 
on each scale (which point approximates the 
chance mean), there would seem to be more 
value in omitting this shading altogether. 
While such an omission ignores the “paradox” 
that some men in an occupational criterion 
group have negative scores on their own scales 
(Strong, 1943, p. 85), the advantage gained 
is that all scores would be interpretable in the 
same manner, with the interpretation empha- 
sizing the empirical derivation of the scale 
concerned. Also, there is error at all points 
on all scales. Thus, there seems to be no spe- 
cial advantage in calling attention to one spe- 
cial kind of error. 

The second implication concerns the role of 
chance in pattern analysis of the SVIB. Pat- 
tern analysis by most methods, for example, 
Darley and Hagenah (1955) or Stephenson 
(1960), takes cognizance of the chance area 
chance is not 
chance, then pattern analysis thinking must 
be revised to either incorporate the zero raw 


as chance. If it develops that 


score concept or to utilize letter grade con- 
cepts only. 

The final implication is that it is concep- 
tually not at all satisfying, in considering 
vocational development theory, occupational 
choice theory, or vot ational counseling tech- 
nique, to think in terms of chance, which can- 
not be manipulated (by definition) 
not be 


column of Table 1). This is « spec ially disturb- 


and can- 
replicated (see the random numbers 
ing in view of the amount of one’s interest 
test data that must be discarded if one con 
siders gray area chance as being in fact chance 
(a step that most investigators apparently do 
not take in practice). 


SUMMARY 
Strong Vocational Interest Blank (SVIB) 
letter grade normative data are presented for 
three male groups with scores falling in the 
“chance” area considered as a separate letter 


versus Nonchance 


Scores on SVIB 419 
grade category. While the proportions of 
scores in the chance area differ for the three 
groups, in all groups such proportions are the 
highest of any letter grade category. Using 
the test-retest data of King (1957) and Pow- 
ers (1956), each subject’s score on each of 
44 SVIB occupational scales is tabulated as 
“in-chance” or test. Re- 
test deviations were computed in standard 


“not-in-chance” on 


score units regardless of direction. These ab- 


solute_average deviations were compared be- 
tween King’s two groups and between Pow- 
ers’ two groups. Comparisons were made over 
scales and over persons. All results were in- 


terpreted as implying that hance scores, 
being no variable than not-in-chance 


more 
scores, should probably not be considered as 
resulting from chance, as the term is normally 
used. Other 


were briefly discussed 


these 


implications of findings 


REFERENCES 


, 3: te, & Ba 
measurement. Mi 
ress, 1955 
Kino, L. A. Stability 

Interest Blank profil 

143-147 
Lyerty, S. B 

tional Inter 

1957, 41, 14 
Powers, Ma 

tional interes 

1956, 40, 69-72 


CorRPORATI 


RAND 


, 
1 5 

STEPHENS 
Vocatio 
chologi 4 1 160 

STEPHENSON, R 
ability n 
J., 1961, 39 








Journal of Applied Psy 
1961, Vol. 45, No. ¢ 


VISUAL TARGET LOCATION AS A FUNCTION OF NUMBER 
AND KIND OF COMPETING SIGNALS’ 


DAVID M 
John 


A common problem in the design of man- 
machine systems is that of presenting infor- 
mation to the man quickly and accurately. 
The problem may be to present one informa- 
tional variable, i.e., information that varies 
along some single continuum, or several in- 
formational variables. One method of doing 
this consists of setting up a relationship be- 
tween the values, or continua, of the informa- 
tional variables and the variable aspects of a 
display. Visual displays are most useful for 
this purpose because of the number of ways 
in which a visual signal can vary. For ex- 
ample, the signals on a PPI are recognized as 
small lighted areas on the tube face. Informa- 
tional variables can be presented to a man by 
assigning meaning to, or coding, the variable 
dimensions of these sign 
information. 


als to represent the 


There are several general methods of ac- 
complishing this assignment. One method, 
which can be called the nonredundant method, 
is to relate an informational variable directly 
to some one variable dimension of the sig- 
nal, e.g., the varying brightness of the above 
“small lighted area on the tube face” might 
be made to signify the size of the object the 
signal represents. Here brightness is the visual 
dimension of the signal, and size of the ob- 
ject is the informational variable. Theoreti- 
cally, as many informational variables can be 
coded as there are variable dimensions of the 
signals. A large red lighted area might repre- 
sent a large object at an altitude of 20,000 
feet, a small blue lighted area might repre- 
sent a small object at 15,000 feet, etc. 

1 This research was d 
248(55) between the Office 
Johns Hopkins University. This is Report No. 15 
under that Reproduction in whole in 
part is permitted for any purpose of the United 
States Government. The author wishes to express his 
appreciation to A. Chapanis and W. R for 
their guidance during this investigation 

2 Now at Bell Telephone Laboratories, Whippany 
New Jersey 


under Contract Non 
of Naval Research and 


one 
contract 


or 


Garner 


1?) 


PROMISEL * 
Hopkins Universit) 


Another method is that of redundant cod- 
ing, i.e., to signify an informational variable 
by a combination of signal dimensions each 
of which uniquely, as well as in combination, 
represents the informational variable. The 
value of the informational variable being pre- 
sented can be determined by referring to any 
one of the several signal dimensions used as 
codes, e.g., a large red lighted area would now 
represent only the altitude of the object with 
both large and red signifying 20,000 feet. 

A third technique is to use a partially re- 
dundant coding system. In this case the in- 
formational variable is again signified by a 
combination of dimensions but the value of 
this variable cannot be determined exactly 
from any one of the dimensions, as is true in 
the completely redundant case. At least one 
of the dimensions does uniquely determine 
the value but other dimensions merely limit 
the possible range of the values. For example, 
given a square blue signal representing the 
altitude of an airplane, the square could 
signify that the altitude was less than 20,000 
feet while blue could indicate exactly 17,000 
feet. 

Dimensions that visual 
include: hue, shape, area, 
number, length, angular orientation, bright- 
ness, flash rate, and stereoscopic depth. Each 
of these has its limitations in terms of the 
number of identifiable steps within the di- 
mension, its effect upon operator fatigue and 
distractability, the interference with other di- 
mensions, 


may be used 
geometric 


as 
codes 


and the speed and accuracy of 
identification of the components, or values, of 
the There 
problems involved in presenting any one of 


dimension are also engineering 
these codes. 

There are many apart from the 
codes themselves which affect the efficiency 
of use of a visual coding system. One such 
factor may termed “clutter”; the 
complex the visual display the harder it is 
to find any specific signals (Eriksen, 1953, 


factors 


be more 





Visual Target Location 


1954b; French, 1954; Green & Anderson, 
1955, 1956; Reese, 1953). “Competition” is 
another factor which occurs when irrelevant 
signals on the display have some, but not all, 
dimensional values the desired 


signal or target. For example, if large red sig- 


identical to 


nals were being searched for, a small red sig- 
signal. When 
performance in locating targets is deficient 
the use of redundant coding may be of value 
in improving it (Eriksen, 1952, 1953; 
& Anderson, 1955, 1956; Reese, 1953). 
Most of the research done in this area has 
been concerned with the coding of one infor- 
mational variable at a time and with the use 
of redundant There has been little 
work dealing with the multiple coding of sig- 
nals to present more than one kind of infor- 
mation simultaneously. This experiment was 


nal would be a “competing” 


Green 


coding. 


conducted to investigate the effect on opera- 
tor performance with multicoded visual sig- 
nals of the following variables: the number of 
targets searched for, the value of the targets 
searched for in terms of hue and shape, and 
the quantity and distribution of the compe- 
tition on the display. 


MertTHOD 


Materials and Apparatus 


The two 
experiment were hue at 
these dimensions can be dis 


than 


visual dimensions used as codes in this 
d shape. Individual values of 
riminated 


more quik kh 


and accurately other dimensions 
picked out fror 
more quik ind ith ewer 
given sized object n be picked out fror 
objects of other sizes As « 


codes, hue 


e.g., in general, a given hu n be 
other hues 
} 


mistake 
lan a 


t 


omponents of redundant 


and shape, again, appear to be t 


dimensions (Eriksen, 1952, 1 
The 


mension wert 


19 


component 


4a: Reese, 1 


forms used as shape di 


circles, hexagor diamonds, triangle 
These were selected 
a list of shapes recommended for use in 
Baker & Grether, 1954). The hues used were (in 
Munsell notation) R 5/6, YR 5/6, Y 5/6, GY 5/¢ 
G 5/6, BG 5/6, and B 5/6. Thess represent the areas 
of most sensitive color vision on the jnd vs 
length plot. The objects, or 


{9 combinations of the 


crosses, stars, and squares from 


coding 


wave 
signals, consisting of all 
two codes, were cut out of 
Munsell paper and glued on to white, 3” 
To keep the size of the objects 
they 


2 om cards 
reasonably constant 


were all made just circumscribable by a circle 
1” in diameter 

A schematic of the display equipment is shown in 
Figure 1. The display board was 3’ square, flat white, 


and ruled off by black lines into 81 squares, 4” on a 











cated below the l 
metal 


perpendicular to sul )e 


ameter circular 


ounted in a black 


the board was 4 


' t 


arge ignal was 
the display board 
panel which 
Illumination 


white flu 


was pro\ 


' 


orescen 

Each of the discs 
a different 
When a 


tion to the right 


symbol 
WI it¢ r 


an electrical circuit was closed and the appropri 


When t! 
sliding panel dropped, it tripped a microswitch, start 
ing an electric timer. The timer was stopped by 
touching the stylus to a disc mounted on the st 


symbol was printed out by the typewriter 


below the sample target 


Subjec ts 

Twenty male Ss w 
| ‘rgraduate stu 
dents of Johns Hopkins University. All 
shortened yersion of the 
Chapanis, 1948). The 


from among the graduate 
Ss passed a 
Test 


hourly 


Ishihara Color Vision 
Ss were paid at an 








422 


rate and rewards were i\ best three 


performances. 

Procedure 
For each trial 40 signal cards and nine blank white 

cards were placed in random orde1 


board. The values of thes« 


the 49 possibilities, subject to the 


on the display 
cards were selected from 
constraints im 
posed by the experiment il design. The 
signal was put in place before the trial and remained 
there throughout the trial. The Ss were handed the 
stylus and were told that, when the sliding panel 
dropped, revealing the display and starting the timer 
they were to touch the stylu 
the signals that matched the sample in both hue and 
shape. When they thought had 
signals they were to touch the stylus to the 
der the he timer. Ss 
given tw 


sample target 


to the discs under all 
they found all such 
disc un 
sample, thereby stopping t wert 
o sample trials but were given no instru 
tions as to how to perform the task. The time scor¢ 
was the time between the dropping of the panel and 
the stopping of the timer 
number of omission and commission errors, i.e., the 
number of target signals liscovered and the 
number of nontarget signals mistaken for target sig 
nals. It was explained to the 
would be 


The error was th 


score 
not ¢ 
Ss that performance 


the Ss’ total time 
and then summing 


measured by ranking 
’ 


scores and error scores separately 


these two ranks ing them equal weight 


} 
t 


for each S, giv 
sum of the ranks would indicate the 


by an S, for the 


The lowest 


performance urpose of awarding 


the prizes 


I 


Experimental Variables 


The experiment investigated the effects on perform 
ance of four independent variables 

Variable VT the signal 
Five of the 49 hue-shape combinations were used as 
targets. These were the R circle, the YR 
the GY triangle, the BG star, and the B 

Variable NT number of target 
be located on the display. Thes« 


was the value of target 
hexagon 
square 
was the 
numbers 
known to the Ss and were 0, 2, 4, 6, or 8 
board \ 
number 


corresponded to a decrea 


as the total number of signals on the 
constant at 40 an increase in the 
signals always 
number of nontarget signals 


Variable NC was the number (0, 8, 16, 24, or 


of competing signals on the display. These were sig 


identical to the target signal in either hue or 
shape but both. Variable NC was, of 
dependent of Variable NT but an 
peting, nontarget signals was of nec« 
with a decrease in 
(i.e., those signals with neither thx 
hue as the target) 

Variable DC distribu 
tion of the competition between the hue and shape 
dimensions. Under condition DC, half the competing 
signals competed in terms of hue and half in terms 
of shape. Under condition DC, all the competition 
was in the hue dimension 


nals 
not course, in 
increase in com- 
ssity confounded 
noncompeting, signals 


same sh ipe 


nontarget 


nor 


represented the manner of 


David M. 


Promisel 


Experimental Design 


The 20 Ss were randomly assigned to two blocks 
of 10 Ss each. These two blocks were treated identi- 
cally. Within a block the Ss were again divided ran 
One 
condition 


domly into two groups of five each 
condition DC,, the other 
otherwise they, too, identically 

Within each DC block the fiv 
Ss received combinations of the remaining three vari 
ables, VT, NT, and NC 
five levels S was given 25 trials, representing 
a different square combination of these three 
variables. These latin squares were formed in such a 
way that a design containing the Ss and any two of 
the three formed a latin 
each level of the third variable. This procedure pro 
within the 
DC conditions except that some of the interaction 


group was 
given DCs, but 
vere treated 


condition in each 


Each of these variables had 
Every 2 
latin 


variables also square f¢ 


duces a complete factorial arrangement 


terms are confounded with subject variances. Com 


parable Ss in the two blocks, although receiving the 
these in a dif 


combinations, received 


oraer 


same variable 


ferent random 


RESULTS 
The two dependent variables in this experi- 
ment were time scores and error scores. The 
time scores were subjected to an analysis of 
variance for a modified random block design 
with fixed The distribution of these 
scores showed some mild skewness and het- 
erogeneity but to transform the data promised 
both to be of little value and to complicate 
the interpretation of the data. Thus no trans- 
formation was made. The results of the analy- 
sis showed the VT, NT, NC, and NC x DC 
terms to be significant. 
There were not enough errors, 


effects. 


129 in 500 
trials, to justify the use of parametric sta- 
tests of them. The 25 
errors were not subjected to any statistical 
analysis at all. The remainder of the errors, 
104, 
target signals that were not discovered. Using 
total omission errors for each S as the unit of 
measure, the difference between DC, and DC,» 
was not significant according to the Mann- 
Whitney U test. Based on two-way tables of 
the omission for the Variables 
NT, NC, and VT, and assuming no interac- 
tions, only the NT and VT variables are sig- 
nificant using the Friedman two-way analysis 
of variance. 
Variable VT 

Figures 2 and 3 show the relative effect on 
performance of the five different hue-shape 


tistical commission 


were omission errors, i.e., there were 104 


error scores 





Visual Target Location 





combinations used as targets. It should be 
noted that, apart from the reversal between 
YR hexagons and B squares and thé steeper 
slope of the omission error curve, target value 
affects performance in a similar manner 
whether it is measured in terms of task time 
or in terms of the number of omission errors. 
Figure 2 shows that, when all other conditions 
are averaged out, the R circle results in the 
quickest task time and the GY triangle the 
slowest. 


These results with regard to target value 


ge 


OMISSION ERRORS 


indication of the critical 
7 


are consistent, but in themselves they give no 
} 


factor which made 








rget value better than another. Since 
hue and shape 
were completely confounded, it is impossible 


targets actually used 


to know whether the R ir 
cause it was a circle or | 

is possible, however, to derive some re i 
inferences about which of these two factors is 
critical by examining some of the subcondi overall rank order as shown 
tions—subconditions which vary in the extent der Subcondition DC,. target 


to which each of these two codes was required 
to be used by S 


pendent on both hue and sh 
case, the rank-order of the 

Table 1 gives the rank-orders of the ti same. With the DC. conditic 
scores for the different target values for must be made in terms of s] 
subconditions. Under the NCp condition still helpful to some extent. a 


re no competing signals on the display. . dition the rank-order of 
may identify the targets in terms of their lated with those of Fi 


or shape or both. The rank-order of the time Finally, under one conditi 


y 
=U 


under this subcondition is identical 


1 i i i 
R YR BG GY 
Ci He Sq St Tr 


TARGET VALUE 


here are eight targets 


signals all compete in 
12 





Thus with this subcond 
tion must be made on 

ilone, and in this case the ran] 

is completely uncorrelated with tl 
ure 2. It seems reasonable. therefore 


pose that the rank-order o 


TASK TIME 











to locate different hue-s! 


aged over all expe 





David M. Promisel 





Whether conclusions regarding the codes 
used here can be extended to hue-shape codes 
in general is an open question. The lack of 
significance of any of the interactions involv- 
ing the VT variable indicates, however, that 
the results pertaining to the other independent 
variables that were obtained with these spe- 
cific codes can be stated with some generality. 











EE 
Variable NT 
= : Figures 4 and 5 show task time and omis- 
ad sion errors, respectively, as a function of the 
e number of target signals on the display. The 
task time function is described as a straight 
: line, justified by an analysis of the data into 
= aa ce ee ; is orthogonal polynomial terms that shows only 


L the linear component to be significant. Al- 
0 2 S 


NUMBER OF TARGETS 
ae a ee © all the terect- siz linear increasing over at least the first four 


1s a function of the number of targe experiemental levels. That the drop in the 
point is the average for all nt 





TASK TIME (SEC.) 

















though no such analysis was possible for the 
omission error function, it too appears to be 


ati function when there are eight target signals 
ol on the display is meaningful is suggested by 
the notion that if the value of NT were car- 
ried to the extreme where all of the display 


in Figure 2 is caused primarily by differences 
in hue and not by differences in shape. 
| 


signals were targets, then there would prob- 


‘he only difference between the subcondi- : Sat aan 
Pe? a ae ably be no omission errors. This drop might 
tion in which target identification is depend- 


ent completely on shape and the other three 
subconditions is the increasing dependency of 
the identification on the hue dimension. There 
is a shift in the rank-order away from that of 
the purely shape dependent subcondition as 


indicate the beginning of a decreasing func- 
tion. 





the dependency on hue increases. It is possible 
to verify further this description of the shift 
Of the 25 commission errors, 23 occurred un- 
der the DC, condition. Of these, 15 occurred 
when G stars were mistaken for BG stars and 
six occurred when G triangles were mistaken 
for YG triangles. In other words, the con- 
fusion of both the BG and YG hues with the 
G hue caused almost all of the commission 
errors. Also, Ss commented to the effect that 
hexagons and circles were relatively difficult 
to pick out, as were YG and BG. These facts 
concerning relative performance with the dif 
ferent code values are mirrored in the rank- 04 
orders mentioned above. Thus, the charac- o 2 4 6 8 


teristics of the hue code used in this experi NUMBER OF TARGETS 
ment override those of the shape code in de- 


OMISSION ERRORS 











otal 1 rl made as a functio1 
termining task time. the number: argets to be found 





Visual Target Location 





(SEC.) 


TASK TIME 











, 

0 8 6 24 32 
NUMBER OF 

COMPETING SIGNALS 


6. Average time to locate targets as a functior 
(Each point is 
s. Th 


condition has been drawn con 


he number of competing signals 


he average for all other experimental condition 
the DC 
necting the points because it contains a 


function for 


} 


ment aS Well aS a linear component 


NC and DC 


Variables 


The effect of the number of competing sig- 
nals on task time is shown in Figure 6 for the 
overall situation and that octurring when DC, 
and DC». are analyzed separately 
functions for the 


‘he linear 
overall condition and the 
DC. condition are again justified by the sig- 
nificance of the linear terms whereas the 
analysis shows a significant quartic as well as 
linear component for the DC, condition. This 
quartic component indicates that there is some 
essential difference in the task required by the 
latter condition, a multicode situation, as 
compared with the former condition, a par- 
tially redundant code situation. This task dif- 
ference does not result in a significant differ- 
ence in task time between the two conditions, 
however. This lack of significance is caused 
by the fact that the term for 
testing the significance of this difference is a 
between-subjects term, which was quite large 
in this experiment. It seems likely that with a 
larger number of Ss a difference 
would be found. 


proper error 


significant 


Overall performance 


Ss’ performance was measured in terms of 
both task time and The data show 
that where the independent variables affected 
the two measures, they did similar 
manner for both of them. This was despite < 
tendency for the Ss to trade off speed for ac- 
curacy 


errors. 


so In a 


and vice versa. The Spearman rank 
correlation coefficient for the relationship be- 
tween the total time and error scores for each 
of the 20 Ss is —0.626. The similarity be- 
tween the measures is apparent 
though the Ss’ trade-off tendencies 
mask it out. 


two even 


Serve to 


DISCUSSION 


Ss’ task in this experiment required them 
to make two kinds of overt and 
covert. The overt that actu- 
ally required in tapping the metal discs, and 
the covert response is the discrimination re- 
quired to decide which discs to tap. The NT 
variable was the only one affecting the overt 


responses, 


response is 


responses, and, in terms of effect on perform- 


ance time, it was by far the most important 
The sum of squares in the analysis of vari- 
ance for this variable was far greater, by a 
factor of 10, than those for the main effects 
of the other variables and amounted to almost 
a third of the total sums of squares. It 
likely that the effect of NT on 


time is due primarily to the linear increase in 


seems 
linear task 
the number of overt responses required as the 
number of target signals increases. The non 
zero intercept of the function is created by 
the necessity for S to scan » display even 


when there are no targets on it. It follows 
that the most substantial savings i 

in this experiment could have been effected 
by designing S’s task of designating targets to 
require fewer and/or quicker overt responses. 


Variables VT, NC, and Dt 


covert. or discriminative, 


involved only 
from S 
and had relatively small but nevertheless sig- 
nificant task Thus, the NT 


effect is probably a composite of two func- 


responses 


effects on time 
tions. One is the effect produced on task time 
by NT as a result of the overt responses re- 
quired by it; the other is the result of the 
covert responses required. Since the overt re- 


sponses would be far more time consuming 





106 David M. 


than the covert the former would 
mask out the latter. Perhaps the covert re- 
sponse function for NT is similar in shape to 
the omission error vs. NT curve, since the 
number of omission errors is affected only by 
covert responses and not at all by overt re- 
sponses. 

The NC variable affected only the difficulty 
of the perceptual discriminations, not the 
number of them, i.e., the amount of clutter 
on the display remained constant. For any 
value of NT S had to examine a constant 
number of nontarget signals. But as the value 
of NC increased, the difficulty of discrimi- 
nating the target signals from the nontarget 
signals increased. If the significance of the 
NC x DC term is to be meaning it 
must be that the difficulty of discrimination 
is a function not only of the quantity of com- 
peting targets, but also of their quality. 

A further explanation of this function can 
be given in terms of the “effective” number 


responses, 


given 


of signals to be examined as determined by 
the manner of distribution of the competition. 
In the partially redundant case, DCs, it has 
been shown (Green & Anderson, 1955, 1956) 
that the competing value effectively deter- 


mines the area of search. S, in some sense, 
only examines those signals having the com- 
peting, i.e., partially redundant, value. This 
fact would explain the linear effect obtained 
in DC.s: the larger this effective area, as NC 
increases, the longer the search time. Now, if 
one assumes that for sufficiently large num- 
bers of competing signals one of the two codes 
in DC, approximates a partially redundant 
code and thus determines the effective search 
area, whereas for smaller numbers the num- 
ber of competing hue and shape signals com- 
bined determines that area, it is possible to 
generate a predicted curve with a shape simi- 
lar to the actual curve shown in Figure 6 for 
DC,. The predicted curve is based on the Ss’ 
having to search out effective areas of the fol- 
lowing numbers of signals (averaged over 
NT) as NC increases from 0 to 32: 4, 12, 20, 
16, 20. Thus, at the lower three levels of NC 
the effective area to be searched by the S in- 
cludes the targets (an average number of 
four) and all the competing signals, in both 
hue and shape. For example, at the second 
level of NC there is an average of four tar- 


Promisel 


gets on the display, four signals competing in 
hue and four competing in shape, producing 
an effective search area of 12 signals. At the 
upper two levels, the effective area of search 
would include, again, the targets plus only 
those signals which compete in one dimension 
(perhaps the signals that compete in hue). 
At the highest level, for instance, there are 
the four targets, 16 competing hue signals 
and 16 competing shape signals. It is hy- 
pothesized, though, that S, in effect, picks 
one of the target values (e.g., target hue) and 
then limits his area of search to only those 
signals having that value. 


SUMMARY 


Performance in obtaining information from 
a double coded visual display was investi- 
gated as a function of these variables: the 
number of target signals on the display, the 
values of the hue-shape combinations used 
as target signals, the number of competing 
signals on the display, and the distribution of 
this competition between the hue and shape 
dimensions. 

The 


targets, 


targets, the value of the 
and the number of competing sig- 
nals all significantly affect task time—the 
number of targets having by far the greatest 
effect. Both task time and errors are greater 
for some targets than for others, and analysis 
suggests that hue rather than shape is the 
critical factor. Both task time and errors are 
again larger numbers of targets, 
and task time alone increases with the larger 
number of competing signals. While there is 
no difference in task time between locating 
double coded targets and partially redundant 
targets, the tasks themselves 
different. 


number of 


for larger 


appear to be 


REFERENCES 


Baker, C., & 
information 
54-160 

Cuapanis, A. A comparative study of five tests of 
color vision. J Opt. Soc. Amer., 1948, 38, 626-649 

Conover, D., & Krart, C. The use of color in cod- 

USAF WADC tech Rep., 1958, No 


GRETHER, W 
USAF W 


Visual 
{DC tech. 


presentation of 


Rep., 1954, No 


ing displays 
55-471. 
Duncan, A. Quality control and industrial statistic 
Chicago: Richard D. Irwin, 1959 
Eriksen, C. W. Location of objects in a visual dis 
play as a function of the number of dimensions on 





Visual Tar 


which the object 
44, 56-60 
ERIKSEN, C. W. Object location in a complex percep 
tual field. J. exp. Psychol., 1953, 45, 126-132 
ERIKSEN, C. W. Multidimensional stimulus differ 
ences and accuracy of discrimination. USAF WAD( 
tech. Rep., 1954, No. 54-165 a) 
Ertksen, C. W. Partitioning and 
perceptual field and efficiency of visual search 
USAF WADC tech. Rep., 1954, No. 54-161. (b) 
Fisuer, R. A., & Yates, F. Statistical table 
logical, agricultural and medical research 
Oliver & Boyd, 1953. 
Frencu, R. S. Pattern recognition in the presence of 
visual noise. J. exp. Psychol., 1954, 47, 27-36 


differ. J. exp. Ps) 


saturation of the 


for bio 


London 


rvet Location 


Green, B., & ANverRsON, L. Size ¢ 
search task. Mass. Inst. Technol 
No. 38-17. 

Green, B., & ANpeRsoN, L. Color coding in a visual 
search task. J. exp. Psychol., 1956, 51, 19 

Humpuey, G. Thinking: An introduction to its ¢ 
perimental psychology. New York: Wiley, 1951 

Reese, E. P., Reese, T. W., Vorkmann, J., & Cor 
BIN, H. H. Psychophysical research 
port: 1946-1952 
1953, No. 131-1-5 

Scnerre, H. The analysi 
Wiley, 1959 


4 


group Rep., ] 


Ssummar\ 
USN S pec Dei Cent ¢ 


n ep 


Received January 12, 





Journal of Applied Psychology 
i, 45, Ne 428-4 


EFFECT OF 


PERSONAL CHARACTERISTICS ON 


RELATIONSHIPS BETWEEN ATTITUDES 
AND JOB PERFORMANCE’ 


FRANCIS D. HARDING 


AND 


ROBERT A 


BOTTENBERG 


Personnel Laboratory, Lackland Air Force Base 


Because the relationship between measure- 
ments of attitudes and job performance has 
generally been found to be low, the nature of 
research in this area has been undergoing 
change. One conclusion has been to no longer 
consider satisfaction and morale 
variables intervening between supervisory and 
organizational characteristics on the one hand 
and productivity on the other (Kahn, 1960). 
Another suggestion has been that morale be 
considered as output variable, and at- 
tempts be made to delineate the conditions 
under which morale is related to other group 
outcomes (Whitlock & Cureton, 1960). 

This study investigates the possibility that 
by giving more consideration to character- 
istics of workers and jobs, greater relation- 
ships between attitudes and productivity will 
be found. In this regard, it has been shown 
that workers performing different kinds of 
work tend to react differently toward man- 
agement policies (Sayles, 1958). Also, that 
the amount of work experience of an indi- 
vidual and the organization to which he be- 
longs does affect his ratings of the work ac- 
tivities making up his job (Harding & Nau- 
rath, 1960). 

This report deals with a reanalysis of the 
data collected by Whitlock and Cureton in 
their investigation aimed at developing effec- 
tive means for measuring morale among Air 
Force personnel. In that study no attempt was 
made to control for the influence upon an air- 
man’s attitude of his rank, age, length of serv- 
ice, career intention, or type of work per- 
formed. The purpose of the present research 
is to determine the extent to which such bio- 
graphical data affect the relationships between 
attitude and job performance. It was hy- 
pothesized that personal characteristics such 


indexes as 


an 


‘The research reported in this paper was spon- 
sored by Personnel Laboratory, Aeronautical Sys 
tems Division, under AFSC Project 7719, Task 17130. 


as rank, length of service, career intention, 
type of work, and months within the organi- 
zation when combined with airmen’s scores on 
attitude scales would contribute to the pre- 
diction of criteria such as job performance 
and absences from work. 


PROCEDURE 

The study involved the multiple 
regression techniques to the data collected by Whit- 
lock and Cureton. Attitude scores and biographical 
data variables were used to predict criteria of job 
performance airmen. First, all predictors 
were allowed to enter the regression equation. Then, 
by removing certain groups of predictor variables it 
was possible to determine their independent 
tribution over and that of the variables re 
maining in the equation 
The variables used in 
be classified as follows 


application of 


for 376 


con- 
above 


the regression analysis may 


Attitude variables 


1. Satisfaction with supervisor—supervisor-sub- 
ordinate relations 

2. Satisfaction with Air 
ganization well the 
complishing its mission 

3. Job satisfaction 

4. Satisfaction with 


to relations between 


a military or 
is felt to be ac- 


Force as 
how Air Force 
interest and pride in job 
civilian community 
ocal community 
with Air 
of being in the Air Force 

6. Satisfaction with 
tion 


-reaction 
and airmen 
5. Satisfaction Force life—overall benefit 
management and communica 
well informed the airmen feels he is 
7. Satisfaction with unit—pride in work 
and opinion of the group’s leaders 
8. General covering 
measured by the other scales 


how 
group 
morale 


everything that is 


Biographical variables 

1. Rank of the airmen—this characteristic was rep 
resented in the regression analysis by five categorical 
variables, one each to indicate membership in a pat 
ticular rank category 

2. Career intention—whether or not an airman in- 
tended to make a career of Air Force service 

Length of service—the sample of airmen was 

grouped into those who had less than four years of 
service and those who had more service 

4. Kind of work performed—the airmen were di 
vided into two groups: those whose work was closely 


428 





Personal Characteristics 


rABL 


MULTIPLE R?’s OBTAINED IN PREDICTING THI 


[¢ 


Predictor 


All variables 
—— ca nie 
siographical variables o 
Attitude-rank inter 
variables remove 
Linear attit 


ACLLOI 
d 


de variable 


related to flying and maintaining air 
work was less involved 
tional mission 
5. Number of 
nonths the airmen had 


unit 


whose 


iteraction variables 
1. Interaction betw 
ables which show 
of airmen who hold the san 
five airmen ranks 
each of the attitude scales 


en 


the scores 


eight 


The 


there are Val 
elements 
teraction variables are the 


rank 


na 


holding the same and 3 


ing other ranks 


Criterion variables 
1. Job performance rating 
n a six point scale by 


ranked in respect to overa 


»b performance rat 


supery isor 

absenc 
to 
peric 


3. Estimated 


had 


irr asked 
month 


sickness 


nan 


previous 


wn 


RESULTS AND DISCUSSION 


How much of the criterion variance is ac- 
counted for by various combinations of pre- 
dictors is shown in Table 1 by use of multiple 
R*s. The first row shows the proportion of 
variance accounted for when all the predi 
equation. The 
41, and 
.38, respectively, for the three criteria. The 
size of the correlations are encouraging and 
suggest that with additional refinements the 
variables would provide usable research tools 


tors are allowed to enter the 
values represent correlations of .43, 


An analysis was performed to test whether 
biographical variables would contribute a sta- 
tistically significant amount of unique vari 


and Job Performance 


E 1 


JoB PERFORMANC! 


Job 
riormance 


rating 


ance when used with a linear combination of 
the attitude variables. This to be 
the « for the job performance 


4 t 
teria. These findings ar 


was show! 


ase two 


keeping with 


e in 


hypotheses stated above and mean that 


variables to‘pre- 


you are only using attitude 


dict job performance, you can improve you 


prediction by adding biographical variables 


such as those used in this study 

An additional analysis was made to deter 
if the attitude variables added to the 
prediction of job performance beyond that 
which was predicted by only the biographical 


mine 


study 


variables. For example, in the present 


it is possible that an airman’s rank, length of 
service, Career intention, or the type of worl 
performed might account for the vari 


ance in 


job performance to the extent that attitude 
variables added nothing further. As shown in 
Table 1, the multiple R*, obta 
} 


ined when only 
, 
les 


varial used 


dictors is .08 for jol 


ita nre- 


) performance ratings and 


biographical ( ire is 
9 for the job perform ince rankings, and fot 
the criterion of estimated absences it is .04 


When flect 


ect attitude I 
ures are added to the gt 


variables which re nea 

oup of predictors the 
16, and .14 for 
While appreciable, these 


dle 
R°s 


squared multiples become .18 
the three criteria 


increases in the multiple 
at 


One 


tistically significant 


possible interpretation is rease 
of a 


the 
f additional pre- 


due to 


4 Io 
dictor variables rather t 


predic tive efficient \ is use 
very sizeable number ( 
the addition of 


The failure 


han 
of the atti 
tude variables to make a significant independ 


independent variance. 


ent contribution beyond that which is con- 


tributed by the biographical variables is’ an 





430 


important finding. It suggests that much of 
the variance which in the past has been at- 
tributed to attitudes is held in common by 
the more easily obtained biographical vari- 
ables. 

The extent to which the attitude by rank 
interaction variables contribute to the predic- 
tion of job performance was also tested by 
a comparison of the appropriate multiple R?s. 
For each of the three criteria there was an in- 
crease in the proportion of predictable vari- 
ance when the interaction variables were 
added to biographical and linear attitude pre- 
dictors, however, none were large enough to 
be statistically significant. ’ 


CONCLUSIONS 


The suggestion was made that one reason 
for the lack of relationship usually found be- 
tween attitudes and job performance is the 
failure to take into account important bio- 
graphical and situational variables. It was 
hypothesized that the inclusion of such vari- 
ables. in multiple regression problems would 
improve the prediction of job performance. 

Comparisons of the multiple R’s, obtained 
when several combinations of variables were 
used to predict criteria of job performance 
and absences from job permit the following 
statements to be made: 


Francis D. Harding and Robert A. Bottenberg 


1. The contribution of attitude variables to 
the prediction made by biographical variables 
is not significant. 

2. The contribution of attitude—rank inter- 
action variables to the combination of linear 
attitude variables plus biographical variables 
is not significant. 

3. The contribution of biographical vari- 
ables to a linear combination of attitude vari- 
ables is significant for job performance cri- 
teria but not for the estimated absences cri- 
terion. 

These findings suggest that much of the 
criterion variance associated with attitude 
measures is also related to more easily ob- 
tained biographical characteristics of workers 
and that biographical variables should be con- 
sidered when relating attitudes to produc- 
tivity. 


REFERENCES 


Harpinc, F. D., & Nauratu, D. A. Effects 
experience and organization on the rating of 
Engng. industr. Psychol., 1960, 2, 63-68 

Kaun, R. L. Productivity and job satisfaction 
sonnel Psvychol., 1960, 13, 275-287 

Sayies, L. R. Behavior of industrial work 
Prediction and contrcl. New York: Wiley, 

Wuuttock, G. H., & Cureton, E. E. Validation of 
morale and attitude scales. USAF WADD tech 
Rep., 1960, No. TR-60-76. 


ot iob 


tasks 
Per- 


group 


1958 


(Received January 13, 1961 





Journal of Applied Psychology 
1961, Vol. 45, No. 6, 431-434 


THE STABILITY OF GUILFORD-ZIMMERMAN 
PERSONALITY MEASURES 


JAY M. JACKSON 


University of Kansa 


In using test scores in the selection of em- An examination of the quite extensive lit 
ployees, it is desirable that so-called measures erature yields no evidence regarding the ques- 
of personality remain relatively stable over tion raised: the GZTS has been correlated 
time. This is the only justification for assum- with intelligence scores (Bass, Wurster, Doll 
ing that they describe attributes of the person, & Clair, 1953; Psychometric Notes, 1954 
rather than the person’s reaction to his im college grades (Psychometric Notes, 1954 
mediate situation. Many persons, for example, academic achievement (Bendig & Sprague 
will feel somewhat insecure in a new environ 1954; Goedinghaus, 1954), leaderless grou 
ment with strangers. But some persons will discussion scores (Bass, McGehee, Hawki 
have a greater tendency than others, and will Young, & Gebel, 1953; Bass et 
require more supportive and less threatening success in training occupational 
situations. One would expect such persons to dents (Booth, 1957), personnel 
(Hilton, Bolin, Parker 
1954). selection and 
(Kirkpatrick, 1956) 

(Pitcher & Stacey, 1954), and other 
ility tests (Gilbert, 1950; Jones 

& Witryol, 1957). Norms have 

for males vs. females | ‘le & 
Isaacson & Cottle 

(Isaacson & Cottle, 1 


be less secure than the average regardless of 
situational changes, and their relative inse 
curity to persist over time 

\s part of a larger study’ of factors re 
lated to turnover in two commercial offices of 
a telephone company, the Guilford-Zimmer- 
man Temperament Survey, a_ standardized 
personality inventory originating in factor 
analysis, was administered to 96 female em- 


Negroes vs. whit hnson & Cottle 


ployees. A number of findings pointed to the und mermale va ag ere 
relevance for performance of several of the 4956) 
traits measured by the GZTS. It was found 


I 


But there appears to be no published 
evidence regarding the relatiy ibility of 
for example, that girls who were relativels ; 
high on Sociability and Ascendance had a 
greater aptitude for the job, and those low 


the trait scores Ove! 


on Security, an index constructed by the in 
vestigator from items of the inventory (Jack- odualateteend 
son, 1956), had considerable difficulty making white-collar emplo 
an adjustment to the work situation. In the _ tives,” and 24 femak 
light of these and other results it was con- “‘istticts of a telephone 
cluded that the GZTS showed promise as a wes a Aoagscth p08 
potential management aid in selection of per- test. sample had beet 
sonnel for the job being studied, if it could Service Representatiy 
be concluded that the test was measuring nd a half 
relatively persistent characteristics of persons, . = i ee 
rather than reflecting reactions to the test vulbdects wees pacer a 
situation, the current life situation, or tempo- job training had 1 
rary feeling states. ven matried, some promoted, 
; For many of the subjects, friends 
A number of papers reporting results of this pro left and me friendships had_ be 
ect are in the process of publication. A repo ; supervisors i | 
made to the company in 1955 ee Jackson, 5 











432 


assumed to have undergone change. Thus it was as- 
sumed that the psychological situations for most sub 
jects were different for the two administrations of 
the test. 


The rationale provided the subjects for adminis 
tering the personality inventory was that the re- 
search was attempting to determine whether a par 
ticular type of girl was attracted to their job, or 


whether they were simply a “cross section of Ameri 


can girls.” It was implied that they were successful 


employees, and that management wanted to 
to select others like them 


be able 
It appeared that these in 
structions were successful in reducing the threat for 


Jay M. Jackson 


RESULTS 


Scores for the two administrations of the 
GZTS are reported in Table 1. Since it is 
possible that length of employment at the 
time of initial testing might be a factor in 
whether or not changes occurred, results are 
presented separately for less experienced and 
more experienced employees. Means and stand- 
ard deviations are also presented for the total 
sample, including supervisors. There is a strik- 
ing similarity between the 1953 and 1955 


subjects. They were cooperative, relaxed, and the Mean scores for all scales and for each cate- 
test situation seemed close to ideal gory of the sample. The difference between 
rABLE 1 
STABILITY OF GUILFORD-ZIMMERMAN SCORES OVER AN 18-MONTH 
PERIOD FOR 49 FEMALE OFFICE WORKERS 
Less More Total 
experienced* r ‘perienced sample® 
ocale VW SD V/ SD V SD 
General Activ 1953 16.9 4.6 19.1 6.9 18.2 6.2 
1955 15.4 5.5 20.0 6.0 17.9 6.2 
Restraint 1953 14.6 5.1 16.8 4.1 16.9 4.8 
1955 14.3 5.3 17.0 4.4 17.0 5.1 
Ascendance 1953 13.0 5.1 13.9 $.9 14.2 5.0 
1955 14.2 5.2 14.5 6.1 14.7 5.5 
Sociability 1953 22.5 $.6 21.0 5.4 22.0 5.5 
1955 220 13 71.0 6.4 1.6 58 
Emotional Stabi 1953 18.0 $4 20.2 4.7 20.0 4.7 
1955 18.6 1.2 20.9 5.4 20.4 5.4 
Objectivits 1953 19.4 6 19.7 3.6 19.6 4.5 
1955 19.3 $.1 19.5 4.5 19.6 4.7 
Friendlit 1953 16.4 3.6 17.0 6.2 17.7 
1955 156 72 15.8 50 16.7 1.9 
Thoughtfulness 1953 16.9 6.0 17.4 41 18.7 6.6 
1955 16.5 7.2 16.4 4.5 17.3 5.6 
Personal Rela 1953 0.3 44 21.0 5.5 216 54 
1955 20.8 $8 20.7 5.6 21.6 5.8 
Masculinity-F 1953 10.9 3.3 11.6 t.4 11.4 3.8 
1955 12.1 3.6 10.4 1.9 11 $3 
Security 1953 7.4 1.1 7.4 2.0 1.3 1.6 
1955 7.4 1.4 7.4 1.6 7.4 La 
® Subjects with less than one year's service at initial testing, .V 11 
Subjects with one year or more se¢ e at il testing, 3 
¢ Includes supervisors, NV = 49 
This is a 10-ite cale (see Jack 1956 








Stability of Guilford-Zimmerman Measures 


TABLE 2 


ON Propuct 
GUILFORD- ZIMMERMAN 


MomENT CORRELATIONS 


BETWEEN Two ADMINISTRATION 


TEMPERAMENT Survey, 18 Montus Apart, 


FOR 49 FEMALE OFrFICE WORKERS 


cale 


General Activity 
Restraint 

Asc endanc¢ 
Sociability 
Emotional Stability 
Objectivity 
Friendliness 
rhoughtfulness 
Personal Relations 
Masculinity-Feminit 


Securit 


each pair of means was evaluated, using the 
customary ¢ test for matched groups. In no 
case was it large enough to warrant confidence 
that any real change had taken place. Signifi- 
cance levels ranged between .70 and 1.00, far 
from approaching an acceptable level of con- 
fidence. 

Even though mean scores on the test did 
not change significantly in the 18-month pe- 
riod, it is still possible that some scores in- 
and others decreased, with a 
effect that left the means un- 
changed. To explore this possibility, Pearson 


creased com- 


pensatory 


product-moment correlations were computed 
between the 1953 and 1955 scores. The results 
of this analysis are presented in Table 2. 
Correlations for the total sample and for 
the more experienced subjects are all so large 
that they would occur by chance much less 
often than once in a hundred times. On a few 
of the scales for the less experienced group, 
small, indicating 
considerable shifting in both directions from 
the original scores. Considering that there are 
only 11 persons in this category, this result 
appear too serious. Further 
examination of these scores indicated that the 
small correlations resulted from large changes 
in the scores of two subjects who had been 
married in the interval between the two test 


correlations are relatively 


does not to be 


experient 


Less 


More 


experienced 


5 


i 
/ 


( 
&3 
91 

1 

71 
59 

92 


2 


administrations. The correlations demonstrate 
substantial stability of the Guilford-Zimmer- 
man scores, and in most cases are of the same 
order of magnitude as the reliability figures 
published in the GZTS manual (Guilford & 
Zimmernian, 1949) 


CONCLUSIONS 


The findings from this study indicate that 
within the limitations of the type and number 
of subjects involved, and to the degree that 
the psychological situations were different at 
the times of the 
Guilford-Zimmerman 


two administrations, the 


scale scores measure 
relatively persistent attributes of the persons 
tested. In this sense the test may be consid- 
ered a personality inventory. It may be that 
when extreme changes occur in the life situa- 
tion of a subject, such as marriage for a quite 


young girl, some personality traits such as 
Emotional Stability, Femininity, and Security, 
may be markedly affected. On the whole, how- 
ever, the scores demonstrate considerable sta- 
bility over time, and high test-retest 
ability. 


reli- 


REFERENCES 

Bass, B. M., McGeuer, ( 
Younc, P. C., & Personality vari 
ables related to leaderless group discussion beha\ 


ior. J. abnorm. soc. Psychol., 1953, 48, 120-128 


R., Hawkins, W. C 
Gepet, A. § 








434 Jay M 


Bass, B. M., Wurster, C. R., Dott, P. A., & Crater, 
D. J. Situational and personality factors in leader 
ship among sorority women. Psychol. Monogr., 
1953, 67(16, Whole No. 366). 

Benpic, A. W., & Spracur, J. L. The Guilford-Zim 
merman Temperament Survey as a predictor of 
achievement level and achievement fluctuation in 
introductory psychology. J. appl. Psychol., 
38, 409-413 

Bootu, M. D. A study 


1954, 

ly of relationship between < 
tain personality factors and success in clinical train 
ing of occupational therapy students. Amer. J 
occup. Ther., 1957, 11, 93 

EtseLe, M. C., & Cottite, W. C. The Guilford-Zim- 
merman Temperament Survey: I. With rural high 


school students. U. Kans. Bull. Educ., 1951, 6, 
12-15 

Franks, C. M. Conditioning and personality: A 
study of normal and neurotic subjects. J. abnorm 
oc. Psychol., 1956, 52, 143-150 

GiiBert, C. The Guilford-Zimmerman Temperament 
Survey and certain related personality traits. J 


appl. Psychol., 1950, 34, 394-396 

\ study of the relationship be- 
academic achievement 
University of South- 


Goepincuaus, C. H 

temperament and 
Unpublished master’s thesis, 
ern California, 1954 

GuitrorpD, J. P., & ZrmMeERMAN, W. S. The Guilford- 
Zimmerman Temperament Survey: Manual of in 

and interpretations. Beverly Hills, Calif 
Sheridan Supply, 1949 

Hirton, A. C., Born, S. F 
Tay or, E. K., & WALKER, W. B 


tween 


tructions 


The validity of 





PARKER, J. W., Jr., 





. Jackson 


personnel assessments by professional psycholo 


gists. J. appl. Psychol., 1954, 38, 287-293 


Isaacson, L. E., & Cotrte, W. E. The Guilford 
Zimmerman Temperament Survey: II. Urban high 
school students. U. Kans. Bull. Educ., 1952, 6, 
46-50. 


conclusions from the 
Ann Arbor, Mich. 
Dynamics, 1955 


Jackson, J. M. Summary of 
commercial department stydy 
Research Center for Group 
(Mimeo) 

Jackson, J. M. A 
psychological insecurity 
1956. (Ditto) 

Jounson, A., & Cotte, W. E. 
merman Temperament Survey: 


scale fo 


Author, 


Guilford-Zimmerman 
Lawrence, Kan.: 


The Guilford-Zim 
III. With urban 


Negro high school students. U. Kans. Bull. Educ., 
1952, 6, 75-80 

Jones, M. B. Aspects of the autonomous person 
ality: IV. Traits from the Guilford-Zimmerman 
Temperament Survey. USN Sch. Aviat. Med. res 
Rep., 1954, No. NM 001058.25.16. 


Karss, W. A., & Witryor, S. L 
tive faking on a forced-choice authoritarian scale 
J. appl. Psychol., 1957, 41, 333-339. 

KIRKPATRICK, J. J. Validation of a test battery for 
the selection and placement of engineers. Personnel 
Psychol., 1956, 9, 211-227 

Pitcuer, B., & Stacey, C. L 
a general trait? J 
49, 3-6. 

Psychometric note 
Sheridan Supply, 


Positive and nega- 


Is Einstellung rigidity 

abnorm oc. Psyc hol., 1954 

, Number 5. Beverly Hills, Calif 
1954. 

7 


Received January 27, 1961) 








Journal of 
1961, \ 


Applied Psychology 
+5, No ' +4 


\ 


RELATIONSHIPS BETWEEN INDIVIDUAL PROFICIENCY AND 
TEAM PERFORMANCE AND EFFICIENCY 


WILLIAM M. WIEST, LYMAN W. PORTER, ann EDWIN E 


University of ( 


V 


Several studies have been reported compar- 
ing individual proficiency in the performance 
of a task with team proficiency. In general 
these studies have been addressed to one or 
another of the following three problems: (a) 
How does the level of performance of a team 
compare with the performance expected on 
the basis of a simple additive combination of 
proficiency of the individuals composing the 
team? (b) How well can the performance of 
a group of individuals working cooperatively 
on a task be predicted from knowledge of 
the proficiency of individual members of the 
team? (c) Is team performance better pre- 
dicted from the individual performance of the 
more proficient members of a team or from 
the less profictent members, and how can the 
measures of best be 
combined to predict team performance? 
Fox, 


individual proficiency 
Brenner (1958) 
cite several studies indicating that the sum- 
mation of individual contributions in the solv- 
ing of cognitive problems exceeds the per- 
formance which these individuals 
able to achieve when working a group 
Thus in some cases “group interaction may 
inhibit the fullest potential contribution by 
its members” (p. 354). Studies by Husband 
(1940), Marquart (1955), Shaw (1932), Tay- 
lor and Faust (1952), and Watson (1928) 
support this generalization. 


Lorge, Davitz, and 


same are 


as 


The second question has been pursued by 
Comrey (1953), Comrey and Deskin (1954a, 
1954b), and Comrey and Staats (1955) in a 
series of experiments using both motor and 
cognitive tasks. With the motor task Comrey 
found that a surprisingly small proportion of 
the variance of the team performance was 
predictable on the basis of the proficiency 
achieved by the members when performing 
the same task individually. In all of these ex- 
periments using a motor task the proportion 
of predicted team variance was less than one 
half. But with a cognitive task Comrey found 


Hi 


‘ali 


$35 


GHISELLI 


Berkeley 


fornia, 


; 
i 


that considerably more than half of the tean 
variance was predictable on the basis of indi- 
vidual proficiency. 

The relative magnitudes of the relationship 
between team performance and the individual 
performance of the most proficient member 
of the team on the one hand, and the rela- 
tionship between team performance and the 
individual performance of the least proficient 
member on the other hand, appear to depend 
upon the nature of 
or permitted by the group 
task requires the members to perform parts 
of the task alternately, Comrey that 
team performance tended to have a higher 
correlation with the performance of the le 
proficient member of the However, 
when the group task permits greater flexibility 
of the it 
team structured 
ments, team 
more highly correlated with the 
of the most proficient member of team 

The research to be reported here is focused 
primarily on the second and third of the three 
questions listed above. However, data relevant 


the interaction required 


When 


¢ 
L 


task. the 


found 


ast 
team 


interaction 
and 
he 


among members of the 


te assign- 


be 


proficiency 


less im task 


found performance to 


the 


to the first question also are noted. The study 
involves two-person teams working on a Ccog- 
nitive interaction among the 
members in the performance of the team task 
is allowed but not specified in “its type or 
amount. 


task in which 


METHOD 


Nine 
culty were « 
contained 
1-toot 
abstract 


jigsaw puzzles of approximately ¢ diffi 


i” fiberboard. Each puzz 
hen hI 


the 


n«t 
onst 


pie 


ructed of 


es and w assembled formed 


p 


basis of 


» 7 


On 


square aces uzzles were 


designs the 


lardi 
terms 


1 small 
nked 
ered 
the puzzles could be presented in 
difficulty to minimize th: 

puzzles are considered to be 
they primarily require cor 
and only 


Ste 


zation sample th of 


difficulty of solution 


e puzzles were in 


Being ord in this manner, 


reasing order of 
practice. The 
cognitive tasks because 


n 


etiects ol 


judgments or decisions 


That 


rect 
a minimal level of manual dexterity 














436 W. M. Wiest, L. 


is, the puzzles involve essentially problems of de 
cision making rather than of implementation. 

The subjects were 64 pairs of male undergradu 
drawn from psychology Most pairs 
were composed of men who were total strangers, 
but a number were casual acquaintances and two 
pairs consisted of men who were friends 

Four puzzles were used to ascertain each subject’s 
level of proficiency on the task while working indi- 
vidually. The puzzles were briefly described to the 
subject, and the pieces were placed face up in random 
order on the table before him. The score for a 
puzzle was the time in seconds required to assemble 
it completely. One subject in each pair was given the 
four puzzles, one at a time, to assemble by himself 
while his partner filled out a personal data question- 
naire. After each subject completed his first indi- 
vidual task he exchanged positions with his partner. 
When both subjects had completed their individual 
tasks they were given the remaining five puzzles, one 
at a time, to assemble as a team. The subjects were 
allowed to talk with one another as much as they 
wished in deciding how best to proceed as a team. 
The behavior and the comments of most of the sub- 
jects indicated that they enjoyed the tasks and were 
highly motivated to perform well. 


ates courses 


RESULTS 

For each individual subject a time score 
was determined by averaging the times re- 
quired by him to complete the first four puz- 
zles. Reliability coefficients were estimated 
from the coefficient of correlation between 
the scores on the odd and even puzzles cor- 
rected by the Spearman-Brown formula. The 
reliability coefficient for the faster subject 
was .85, for the slower .90, and for all sub- 
jects together .93. 

The score for each team was computed as 
the average time required by the pair of sub- 
jects working together to complete the last 


W. Porter, and E. E. Ghiselli 


five puzzles. The reliability coefficient for the 
team scores was .88. This was obtained by 
the use of the formula given by Horst (1951) 
for the situation where the two parts of a 
test are of unequal length. 

An index was computed for each group 
measuring the extent to which team perform- 
ance was superior or inferior to the perform- 
ance expected on the basis of an additive 
combination of individual skills. This index, 
designated D,,.), is the difference between 
the observed and the predicted team time, 
and therefore is a measure of team efficiency. 
The predicted team score was arrived at by 
determining the amount of time it would have 
taken the team to finish the puzzles had both 
members worked at the same rates they did 
on the individual tasks. If there were no in- 
teraction between the members of a team, 
this index, D,,,,, would be zero: that is, 
there would be no difference between the team 
score predicted on the basis of the subjects’ 
performance as individuals and their actual 
observed team score. A positive value of D,,_p 
indicates a negative or inhibitory interaction 
between the subjects when working as a team, 
meaning that the team took more time than 
expected. A negative value of D,,.,, indicates 
a positive or facilitative interaction between 
the subjects, meaning that the team took less 
time than expected. The mean value of D.,» 
for the 64 groups was 18.3, which was sig- 
nificantly different from zero at less than the 
.0O1 level. This indicates that on the present 
task team performance was poorer than would 
have been predicted on the basis of the pro- 


TABLE 1 


MEANS, STANDARD DEVIATIONS, 


Slow Difference Sum 

Fast 284* 164 605*** 
Slow 898*** 936*** 
Difference 685*** 
Sum 
Team 
D 

*p <.05. 

p ol 


*» < 001. 


AND INTERCORRELATIONS 


AMONG THE VARIOUS SCORES 
Team 
ream efficiency 
performance D V SD 

a, 219 283.2 72.79 
628*** 146 469.8 164.48 
316** 049 186.6 159.99 
790*** 200 753.0 197.96 
296* 189.9 40.72 
18.3 20.97 








Individual Proficiency 


ficiency demonstrated by the members of 
teams when working individually. Only 10 of 
the 64 pairs achieved a negative D,,,) score, 
that is, did better than expected. Hence with 
the present task the team situation produced 
an interfering kind of interaction for most 
pairs of subjects. 

Members of each team were designated as 
“fast” or “slow” in terms of the speed with 
which they performed their individual tasks. 
Table 1 shows that the correlations between 
the individual times of both types of subjects 
and team performance are quite high, the cor- 
relation in the case of the fast subjects being 
higher than that of the slow subjects. 

The time scores of the two members of each 
team were added together to yield a “sum” 
score, -and the fast subject’s time was sub- 
tracted from the slow subject’s time to yield 
a “difference” score. As shown in Table 1, the 
sum is more highly related to the team score 
than is either the fast or the slow subject’s in- 
dividual the difference score 
is significantly related to the team score, in- 
dicating that the greater the difference be- 
tween the subjects the longer it takes them 
as a team to complete the puzzles. This rela- 
tionship is to be expected, since the greater the 
difference between the two subjects the larger 
is the sum of their performance (Table 1). 

That the slow and fast subjects’ times are 
not independent predictors of the team time 
is indicated by the significant positive correla- 
tion between their individual scores. As Com- 
rey (1953) points out, this correlation can be 
ascribed to the fact that the fast subject’s 
score, the smaller numerical value, cannot be 
greater than the slow subject’s score, the 
larger numerical value. This imposes a sys- 
tematic linkage between the individual scores 
of the pair, thus producing an artifactual cor- 
relation. 


score. Even so, 


As may be seen in Table 1, the scores of 
the fast subjects bear a low negative correla- 
tion with the difference scores. Such a rela- 
tionship is not surprising when it is recalled 
that all scores are time scores. This negative 
correlation means that the poorer the fast 
subject’s score the less difference there is be 
tween his score and that of his partner. The 
high positive correlation between the slow 
subject’s scores and the difference scores in 


and Team Performance 


part reflects the fact that the variance of the 
scores of the slow subjects is greater than the 
variance of the fast subjects. This, too, is a 
result of placing the fast members of the 
pairs in one category and their partners in 
the other. While a slow subject’s time score 
could not be less than his partner’s, poten- 
tially it could be much larger. Thus, it is 
seen that the greater the time score of the 
slow subject, the greater on the average is the 
difference between the fast and the slow sub- 
jects’ scores. In other words, the poorer the 
slow subject is, the less similar to his partner 
is he likely to be. 

The relationships between the index of 
team efficiency, D,,.,,, and all other variables 
as shown in Table 1 are quite low. While the 
negative correlations between D,,.,, and the 
scores of the fast subjects, the slow subjects, 
and the sum of the scores of both subjects 
are not significant, they suggest that the more 
highly proficient a person is, that is, the 
smaller is his time score, the more difficult it 
is for him to improve his or his partner’s per- 
formance in a team situation. The significant 
positive correlation between team time and 
D..») scores indicates that the better the 
team performance, that is the less time it 
takes the team to solve the puzzles, the more 
likely it is to perform better than the level 
predicted on the basis of individual profi- 
ciency. This positive correlation is not con- 
tradictory to the negative correlations be- 
tween D,,-,, and individual time scores. It 
merely emphasizes the fact that fast teams 
are fast in part because the members are co- 
operating and interacting effectively, and not 
solely because the members of the team hap- 
pen to be highly proficient individually. 

The scores and measures of individual pro- 
ficiency can be combined in at least two dif- 
ferent ways in the prediction of team per- 
formance. First, the individual scores of the 
fast and slow subjects in each pair can be 


used in a multiple regression equation. Sec 


ondly, the average or sum of the individual 
fast and slow scores can be used as one vari- 
able and the difference between these scores 
as a second variable, in another multiple re- 
gression equation. The first such equation 
would deal with the members of the group 
as individuals, and the deal 


second would 














438 W. M. Wiest, L. 


rTABLE 2 


COEFFICIENTS OF MULTIPLE CORRELATION AND BETA 
WEIGHTS FOR PREDICTION OF TEAM PERFORMANCE 


AND TEAM EFFICIENCY SCORES 


Team efficiency 





I inl D score 
Pred R = .847 R = .236 
1 la 8 594 t 193 
ast a B 159 8 0 
I ’ 850 k ri 
t 3 1.080 8 $ 
t 3 425 £6 ( 
b 
i 
with them as a group. These two multiple 


4 


regression equations are presented in Table 2 
where both team performance and team effi- 
ciency (D )are the predicted variables. 

Table 2 shows first that the multiple co- 
efficients of correlation using individual scores 
to predict team performance and team effi- 
ciency are only slightly higher than the com- 
parable first-order coefficients presented in 
Table 1. The multiple coefficients of correla- 
tion are .847 for team performance and .236 
for team efficiency, while the corresponding 
first-order coefficients using the sum of indi- 
vidual scores (Table 1) are .790 and ~-.200. 
The beta coefficients show that the proficiency 
fast member contributes somewhat 
more to the prediction of team performance 
and team efficiency than does the proficiency 
of the slow member in the multiple regres- 
sion equations. 

The multiple coefficients of correlation in 
Table 2 that utilize group characteristics, the 
sum and are the 
those that involve individual characteristics. 
The former coefficients are .850 and .236 and 
the latter are 
in the correlations involving group character- 
the differences between the 
members in their individual proficiency con- 
tribute less to the multiple predictions than 
do the sums of the individual 
scores. With regard to the use of the differ- 
ence scores in the prediction of the two de- 


of the 


difference scores, same as 


847 and .236. The beta weights 


istics show that 


averages or 


pendent variables in Table 2, it is especially 
interesting to note that a relatively large dif 


ference between the individuals in their per- 


W. Porter, and E. E. Ghiselli 


formance as individuals will mean relatively 
good team performance but relatively poor 
team efficiency. The situation with regard to 
the sums, however, is exactly the reverse. 
Large sums indicate relatively poor team per- 
formance but relatively good team efficiency. 


DIscUSSION 


If the questions raised at the beginning of 
this article are now reconsidered, it is pos- 
sible to formulate some answers based on the 
obtained results in the present study. First, 
although it was not the primary aim of this 
investigation, the results support those ob- 
tained in other studies that team performance 
is not as good as performance that would be 
predicted from an additive combination of in- 
dividual scores. The task of working together 
to produce a completed jigsaw puzzle is thus 
one of a variety of tasks that seems to pro- 
duce inhibitory effects when two individuals 
are performing together as a team compared 
to when they are working apart as two sepa- 
rate individuals. 

In answer to the second question of how 
well team performance can be predicted from 
a knowledge of the proficiency of individual 
members of the team, this study showed that 
relatively good prediction can be obtained 
from such information. These results are in 
accordance with those obtained previously by 
Comrey and his associates on certain of their 
tasks. However, if the dependent variable is 
team efficiency (defined as the relation of ob- 
served to predicted behavior) rather than 
team performance, the results obtained in the 
present investigation show that relatively poor 
prediction is obtained from measures of indi 
vidual proficiency. In fact, in the present 
study there were no variables (neither sepa- 
rate measures of individual proficiency nor 
their various combinations) that showed any 
very high correlation with team efficiency. 
Since this is a measure of team behavior that 


has not been used in previous similar studies, 
for the low 
predictability of team efficiency or to define 
those conditions under which such efficiency 
could be more highly predicted 


it is difficult to ascribe reasons 


\ third question posed in the introduction 
concerned the relative predictability of team 








Individual Proficiency 


performance by the more proficient member 
of each team vs. the less proficient member. 
The findings of the present study show that 
the scores of the former type of individual, 
the more proficient member of the team, were 
slightly more predictive of both team _per- 
formance and team efficiency. The fact that 
team performance was somewhat more pre- 
dictable from the faster member’s individual 
score is in line with Comrey’s findings for his 
cognitive task situation (Comrey & Staats, 
1955), but differs from his findings for manual 
dexterity task situations. 
procedures used by Comrey in his experi- 
ments and those used in the present investiga- 
tion indicates that the major 
mining whether the faster or 
ber’s indidvuial score will 


An analysis of the 


deter- 
mem- 
be more predictive 
of team performance is not a manual task 
vs. cognitive task difference, but rather the 
amount and type of interaction allowed be- 
tween individuals when performing as a team. 
For example, in both Comrey’s cognitive task 
procedure and the procedure used here, the 
faster member was allowed to do more than 
50% of the work if he wished; in Comrey’s 
manual dexterity task situations, however, the 
faster member had to proceed alternately with 
the slower member, and thus could use his 
speed on only 50% of the task. Another ex- 
ample of the role of the task determined 
amount and type of interaction is that in both 
Comrey’s cognitive task and the present task 


factor 


] - 
siower 


situation the performance acts of one indi- 


vidual provide a variety of cues to the part 
ner that may influence the 


formance 


speed of his per- 
As an illustration, the placement 
ofa key piece in the partially col ipleted jig- 
saw puzzle will give cues as to which two or 
three other pieces can be added next. Like- 
wise, in the crossword puzzle task, the com- 
pletion of a vertical word will provide several 
cues to the partner assigned to the horizontal 
words. On the other hand, in a task situation 
where there is simple alternation of perform- 
ance acts and rigid task assignments, the only 
cue provided by a single act is the cue for 
the next specified one to take place. 

In the present study, prediction of team 
performance and team efficiency were com- 
puted not only from scores of fast and slow 
individuals, but also from combination scores, 


and Team Performance 


namely, sums and differences of the two indi 
vidual scores for each team. The sum, of 
course, significantly predicted team perform- 
ance, but the difference also was a significant 
predictor of this criterion. This latter finding 
regarding the prediction provided by the dif 
ference scores is in 


contrast with Comrey 


findings in his manual dexterity task situa- 
tions. However, even in the present study, the 
difference scores still offered considerably less 
prediction of team performance than did the 
sum scores. Both types of combinations had 
low with the team 

measure, although again, the sum s 
duced somewhat higher ictabili 


correlations efficient 5 


ores 


SUMMARY 


The present investigation examined the re- 
lationship between the proficiency of subjects 
solving a cognitive-type task individually and 
found that 
team work was on the average less productive 
than the sum of 
Team performance 


in teams of two persons. It 


was 
individual performance 
found to be better 
predicted by the individual proficiency 

better member than by the 
poorer member. 
of the 
bers of the teams gave a 


was 


proficienc \ 

. simple additive combina- 
both 
good prediction of 
team performance, and this prediction 
only slightly 
proficiencies of the two 


1 


tion individual scores of mem- 


was 
increased when the individual 
members were opti- 
mally weighted in a multiple regression equa 
} 


two members ota 


tion. The more similar the 


team were in individual proficiency, the more 
likely they were to form a proficient and ef- 


The 


team 


fective team extent to which the two 
interfered 
working together was 


predicted quite poorly from measures of in- 


members of a facilitated or 


with each other whet 
dividual performance. Several of the above 
results were discussed in terms of the amount 
ind type of interaction allowed between the 
wo individuals when performing together as 
a team. 


REFERENCES 


Comrey, A. L. Group performance 
terity task. J appl Psvychol., 1953, 
Comrey, A. L., & Desxry, G. Furt 
group manual dexterity in n 
1954, 38, 116-118. (a) 





440 W. M. Wiest, L. 


Deskin, G. Group manual dex 
} appl Psychol, 1954, 38, 178 


Comrey, A. L., & 
terity in women 
180. (b) 

Comrey, A. L., & Staats, Carotyn K 
formance in a cognitive task. J. appl 
1955, 39, 354-356 

Horst, P. Estimating total test reliability from parts 

Educ. psychol. Measmt., 1951, 


f 


Group per 
Psychol, 


of unequal length 

11, 368-371 
HussBanp, R. W. Cooperative versus solitary problem 

solution. J. soc. Psychol., 1940, 11, 405-409 
Lorcr, J., Fox, D., Davitz, J., & Brenner, M. A 


survey of studies contrasting the quality of group 


W. Porter, and E. E. Ghiselli 


performance and individual performan 19 
1957. Psychol. Bull., 1958, 55, 337-372 
Marquart, Dorotuy I. Group problem solving. J 
soc. Psychol., 1955, 41, 103-113 
Suaw, M. E. Comparison of individuals and small 
groups in the rational solution of complex prob 
lems. Amer. J. Psychol., 1932, 44, 491-504 
Taytor, D. W., & Faust, W. L. Twenty 
Efficiency in problem solving as a function of size 
of group J exp Psychol., 1952, 44, 360-368 
Watson, G. B. Do groups think more efficiently than 
individuals? J Psychol., 1928, 23. 


328-336 


questions 


abnorm. soc 


(Received February 3, 1961) 











JUST PUBLISHED ... 


TESTS IN PRINT 


A COMPREHENSIVE 
BIBLIOGRAPHY OF TESTS FOR 
USE IN EDUCATION, 
PSYCHOLOGY, AND INDUSTRY 


Edited by 


Oscar K. Buros 


Director, The Institute of 
Mental Measurements 
Rutgers, The State University 


Order directly from 


A valuable new reference work 
for test users and students of testing 


@ Concise information on more than 2,000 tests 
published as separates in English-speaking countric 


@ Comprehensive coverage: achievement batteries 
and tests in specific areas, projective and non 
projective personality tests, intelligence and scho 
lastic aptitude tests, multi-aptitude batteries, voca 
tional interest, aptitude, and selection tests, 
numerous tests in other areas are presented 


© Complete cross references to additional infor- 
mation, critical reviews, and test references to be 
found in any of the volumes of The Mental Mca 
urements Yearbook series published to date 
Serves as a classified index and supplement te 
the yearbook series. 

@ Brief information (tithe and yearbook cross 
references only) on more than 800 out of print 
tests for the convenient reference of yearbook 
users and students of testing. 


@ Other materials of interest to test users 


@ Complete, convenient indexes. 


Ip. xxix, 479 $7.00 


THE GRYPHON PRESS 220 Montgomery Street Highland Park, New Jersey 














ANNOUNCING 
ane" \ 


‘ 
ouanat © 


Devoted to the Technology 
of Human Learning 


Thomas F. Gilbert Carroll Dunn 


EDITOR IN CHIEF EXECUTIVE EDITOR 
EDITUORIA DVISOKS 
John A. Barlow Israel Goldiamond 
Joseph V. Brady Paul S. Siegel 
Charles B. Ferster Charles W. Slack 


Papers of profound interest to psychol- 
ogists, educators and human engineers on 
new applications of reinforcement are 
becoming available in growing numbers. 

The Journal of Mathetics fills the need 
for an organ devoted to their collection. 

It gives the scientists in this field a 
publication of professional standing in 
which they may discuss their own work 


and keep informed on the findings of 
their contemporaries. 


Published bi-monthly from January 1962 


Annual subscription $18. A.P.A. members $12. 
A.S.T.D. members $12. Students $10. 


THE JOURNAL oF MarTHETICs 
Box 3232-A, University, Alabama 
























































a 

















